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SPECIFICATION 

GLYCQSYLTR ANgFRP ACiF. TsnTrT iFTf ^""1 0 ENCODING . 

THE G LYCOSYLTRANSFERAS E^AND ME THOD OF_ 
TESTING CANCERATION USING THE NUCLEIC ACID 



5 TECHNICAL FIELD 

The present invention relates to a novel nucleic acid, 
the nucleic acid for testing canceration, and a method of 
testing the canceration of a biological sample based on a 
difference in the expression level of the nucleic acid in 
10 the biological sample; as well as, to a novel 

glycosyltransf erase and a nucleic acid encoding the 
glycosyltransf erase, and the like. 

BACKGROUND ART 

15 In recent years, attention has been given to the 

function of oligosaccharides and glycoconjugates in living 
bodies. For example, a determinant factor of a blood type 
is a glycoprotein, and one involved in the function of the 
nervous system is a glycolipid. Thus, an enzyme having the * 

20 function of synthesizing an oligosaccharide is a crucially 
important key for analyzing physiological activities 
produced by various oligosaccharides . 

A N-acetyl-D-glucosamine residue (GlcNAc) and a 
D-galactose residue (Gal), and the like, in sugar are the 

25 components of glycosaminoglycan, while they are sugar 

residueis present in various oligosaccharide structures such 
as sphingoglycolipids , mucin- type oligosaccharides, and 
asparagine- linked oligosaccharides (N- linked 
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oligosaccharides). Thus, an enzyme transferring GlcNAc or 
Gal Is a crucially Important tool for analyzing the 
function of oligosaccharides that work In various tissues 
In living bodies* 
5 For example, at least 20 types of 

N-acetylglucosamlnyltransferases having an activity of 
transferring GlcNAc have been knovm as shown in Table 1, 
each of which differs in acceptor substrate specificity 
(References 1 to 18). 

10 On the other hand, oligosaccharide synthesis is known 

to be altered with great frequency in canceration and to be 
correlated with the metastasis and malignancy of cancer 
(References 30 to 32). Their comprehensive studies 
actively conducted today, for example, analysis such as 

15 expression profiling in a variety of tissues, are also 

directed to the elucidation of a canceration mechanism, and 
discussions have often been conducted on the possibility 
that the canceration mechanism is associated with the 
expression level of a particular gene. As well known, the 

20 test of tumor markers or the like in blood and the 

identification of the other gene products involved in 
canceration, and so on, have already been conducted as 
methods of cancer diagnostic tests. Tumor markers include 
many antibodies against oligosaccharides. Among others, 

25 Immunoassay for oncogene products has often been adopted 
because of its advantage in high sensitivity. 
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As described above, the Identification of gene 
products having some Involvement In canceratlon Is expected 
to provide tumor markers useful in cancer diagnosis. If 
especially a nucleic acid found in a transcript can be used 
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as an indicator for testing canceration, only the 
identification of a transcript of a particular gene can 
sufficiently provide an indicator useful in testing 
canceration, without the need for elucidating the function 
5 of its end product^ for example, a protein. Especially the 
identification of a nucleic acid has advantages which are 
not found in immunoassay, because it can be performed on a 
DNA microarray and a nucleic acid even in small amounts can 
also be quantified after being amplified by PGR. 

10 On the other hand, the function of oligosaccharides 

in living bodies receives attention. However, the analysis 
of oligosaccharide synthesis in living bodies does not 
necessarily progress satisfactorily. This is partly 
because the mechanism of oligosaccharide synthesis and the 

15 localization of sugar synthesis in living bodies are not 

sufficiently elucidated. The analysis of the mechanism of 
oligosaccharide synthesis requires the analysis of enzymes 
synthesizing oligosaccharides, especially 
glycosyltransf erases , and analyzing which type of 

20 oligosaccharide is generated with the enzyme. Therefore, 
there also has been a growing demand for finding a novel 
glycosyltransf erase and analyzing its function. 

DISCLOSURE OF THE INVENTION 
25 In light of the above -described problems, an object 

of the present invention is to provide a tumor marker 
nucleic acid significantly altered in its transcription 
level following canceration, a nucleic acid for testing 



canceratlon that tcurgets the tumor marker nucleic acid, and 
a method of testing canceratlon using any of these nucleic 
acids . 

Another object of the present Invention Is to provide 
5 a nucleic acid encoding a novel human glycosyltransf erase 
protein, and the novel glycosyltransf erase protein In the 
analysis of a particular gene that receives attention as an 
Indicator of the canceratlon. The glycosyltransf erase 
protein of the present Invention Is especially a pl,3-N- 
10 acetyl-D-glucosamlnyltransf erase protein having an activity 
of transferring N-acetyl-D-glucosamlne to an acceptor 
substrate through pi , 3-llnkage . 

A further alternative object of the present Invention 
Is to provide a transformant expressing the nucleic acid In 
15 a host cell as well as a method of growing the transformant 
to Isolate the protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the result of quantitative real-time 

20 PGR analysis of G9 transcripts In various human tissues. 
Calibration curves for G9 and GAPDH { glycer aldehyde- 3- 
phosphate dehydrogenase) were obtained by the serial 
dilution of the respective plasmld DNAs. The expression 
level of the G9 transcript was adjusted to the GAPDH 

25 measured for equivalent cDNA. The data was shown as the 
average ± S.D. of values obtained from three experiments. 

Figure 2 shows a buffer In a reaction solution and pH 
dependence, and metal Ion dependence, which are plotted for 



the activity of a G9 polypeptide. In Figure 2A, the effect 
of pH on the activity was assayed with cacodylate (filled 
square) and HEPES (N- [ 2-hydroxyethyl]piperazine-N' - [ 2- 
ethanesulf onic acid]) (filled circle) buffers. In Figure 
5 2B, the effect of a divalent cation was assayed with 
varying concentrations of MnCl2 (filled circle), CaCl2 
(filled square), MgCla (filled triangle), ZnCla (open 
circle ) , NiS04 ( open square ) , and CdS04 ( open triangle ) . 
Figure 3 shows the result of activity measurement 

10 when oligosaccharides pyridylaminated with 2-aminopyridine 
(N-glycans) are used as acceptor substrates. ND represents 
"not detected". 

Figure 4 is an electrophoretic picture showing 
activity measurement when an al -acidic glycoprotein 

15 (orosomucoid) , ovalbumin, and ovomucoid are used as 

acceptor substrates . The reaction mixture of an enzyme 
with the substrate is treated either without glycopeptidase 
F (-) or with glycopeptidase F (+), followed by separation 
by SDS-PAGE. The gel is either stained with CBB (Coomassie 

20 Brilliant Blue) (upper picture) or subjected to 
autoradiography (lower picture). 

DETAILED DESCRIPTION OF THE INVENTION 

The present inventors attempted the isolation and 
25 purification of a nucleic acid of interest that seems to 
have high sequence homology based on the nucleotide 
sequence of a gene of an enzyme similar in action to an 
enzyme of interest. Specifically, at first, the present 



Inventors conducted a BLAST search using the sequence of a 

pi , 3-N-acetylglucosamlnyltransf erase , a glycosyltransf erase 
knovm In the art, as a query sequence and consequently 
found a genome sequence (GenBank No. AC011462} as a 
5 sequence having homology. 

Further, the present inventors successfully cloned a 
gene encoding the protein by PGR and determined its 
nucleotide sequence (SEQ ID NO: 1) and predicted amino acid 
sequence (SEQ ID NO. : 2). A gene having the nucleotide 

10 sequence of SEQ ID NO: 1 was designated as a G9 gene, and a 
protein having the amino acid sequence of SEQ ID NO: 2 was 
designated as a G9 protein. The present inventors have 
thereby completed the present invention by finding out that 
a protein encoded by the nucleic acid is a novel 

15 glycosyltransf erase and that the presence or absence of the 
expression of the nucleic acid or its expression level in 
cancerous tissues differs from that in normal tissues. The 
present inventors further allowed the expression of the 
nucleic acid obtained in the present invention by a genetic 

20 engineering technique to yield a recombinant protein. The 
investigation of the activity of a protein of the present 
invention has revealed that the protein is a pi , 3-N-acetyl- 
D-glucosaminyltransf erase protein having an activity of 
transferring N-acetyl-D- glucosamine from a donor substrate 

25 to an acceptor substrate through pi , 3-linkage . 

The present invention relates to a nucleic acid 
hybridizing under stringent conditions to a nucleotide 
sequence described in SEQ ID NO: 1 or a complementary 



nucleotide sequence thereof . 

Preferably, the nucleic acid of the present Invention 
consists of a nucleotide sequence having at least 15 
contiguous nucleotides In a nucleotide sequence described 
5 In SEQ ID NO: 1 or a complementary nucleotide sequence 
thereof . 

The nucleic acid of the present invention Is 
typically a probe or a primer. The nucleic acid of the 
present Invention can also be a tumor marker. 

10 The present Invention also relates to a method of 

testing the canceratlon of a biological sample, comprising: 
(a) using any of the above -described nucleic acids to 
measure the transcription level of the nucleic acid in a 
biological sample; and 

15 (b) diagnosing the biological sample as being 

cancerous when the transcription level of the nucleic acid 
in the biological sample significantly exceeds that in a 
normal biological sample as a control. 

According to a preferred aspect of the testing method 

20 by the present invention, the method of testing the 
canceratlon of a biological sample comprises: 

(a) using any of the above -described nucleic acids as 
a labeled probe, which is in turn brought into contact with 
a biological sample under stringent hybridization 

25 conditions to measure the transcription level of the 

nucleic acid in the biological sample based on a signal 
from the label of the hybridized nucleic acid; and 

(b) diagnosing the biological sample as being 



cancerous when the transcription level of the nucleic acid 
In the biological sample significantly exceeds that In a 
normal biological saunple as a control. 

According to another preferred aspect of the testing 
5 method by the present Invention, the method of testing the 
canceratlon of a biological sample comprises : 

(a) using the above -described primer that Is labeled 
to subject a biological sample to nucleic acid 
amplification and measuring the amount of a resulting 

10 nucleic acid amplification product; and 

(b) diagnosing the biological sample as being 
cancerous when the amount of the nucleic acid amplification 
product significantly exceeds that in a normal biological 
Scunple as a control. 

15 According to a further aspect of the testing method 

by the present invention, the effectiveness of treatment 
for cancer therapy can be examined by use of the nucleic 
acid of the present invention. 

The method of examining the effectiveness of 

20 treatment for cancer therapy by the present invention is a 
method comprising: 

using any of the nucleic acids according to the 
present Invention to measure the transcription level of the 
nucleic acid in a biological sample that has received 

25 treatment for cancer therapy and comparing its measurement 
value with that before the treatment or without the 
treatment, thereby determining whether the treatment given 
to the biological sample is effective or not. 
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A preferred aspect of the method of examining the 
effectiveness of treatment by the present Invention 
encompasses a method comprising; using the biological 
sample which has already been cancerous, and determining 
5 that treatment for cancer therapy given to the biological 
sample is effective if the transcription level of the 
nucleic acid in the biological sample that has received the 
treatment is significantly below that before the treatment 
or without the treatment. 

10 The biological sample to which the method of 

examining the effectiveness of treatment can be applied 
includes an In vivo biological sample from a non-human 
model animal as well as an In vitro biological sample 
derived from a tissue, a cell, or the like (including a 

15 human tissue or cell, or the like). Alternatively the 
biological sample to which each of the above -described 
methods according to the present invention can be applied 
is typically a sample derived from the large intestine or 
peripheral blood. 

20 In other aspects of the present invention, the 

nucleotide sequence of SEQ ID NO: 1 has 31% homology to 
those of known genes a human pi,3GlcNAc transferase 2 and a 
pl,3Gal transferase 6, and a conserved motif therein is 
close to that in a pi,3Gal transferase. The nucleotide 

25 sequence has 60% homology to that of a murine pi,3GlcNAc 

transferase 1. The predicted amino acid sequence of SEQ ID 
NO: 2 has a hydrophobic transmembrane region characteristic 
of a glycosyltransf erase at its N terminus . 



From these points of view, the nucleic acid sequence 
of SEQ ID NO: 1 presumably encodes a novel human 
glycosyltransf erase that transfers an N-acetyl-D- 
glucosamlne residue to synthesize an oligosaccharide 
5 through pi , 3-llnkage . In actuality, an enzyme protein 
having a biological activity was Isolated and purified 
therefrom and a certain activity was confirmed (Exeimples 4 
and 5 ) . 

Since a protein consisting of the amino acid sequence 

10 of SEQ ID NO: 2 has the activity of the novel 

glycosyltransf erase , providing an amino acid sequence of 
this novel protein and a nucleic acid encoding it would 
make a contribution toward satisfying diverse needs for 
them in the art. 

15 That is, the present invention also relates to a 

glycosyltransf erase protein that transfers an N-acetyl-D- 
glucosamine residue from a sugar donor substrate to a sugar 
acceptor substrate through pi,3-llnkage and relates to a 
nucleic acid encoding the protein. The typical sugar donor 

20 substrate is UDP-GlcNac, and at least a Gaipi , 4GlcNAc 
carbohydrate residue is the acceptor substrate. 

Thus, the present invention also relates to a pi,3-N- 
acetyl -D-glucosamlntyl transferase protein having an 
activity of transferring N-acetyl-D-glucosamlne from donor 

25 substrate to an acceptor substrate through pi , 3-linkage, 

wherein "P" represents an anomer having a els configuration, 
of anomers of glycosldic linkage at position 1 of the sugar 
ring. 
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Moreover^ the glycosyltransf erase protein of the 
present invention includes a glycosyltransf erase protein 
that has at least one of the following properties (a) to 
(c): 

(a) acceptor substrate specificity: 

the glycosyltransferase protein has a significant 
transferring activity for at least Bz-p-lactoside and/or 
Gaipi-4GlcNAc groups, 

wherein "Bz" represents a benzyl group, "Gal" 
represents a galactose residue, "GlcNAc" represents an N- 
acetyl-D-glucosamine residue, and "P" represents an anomer 
having a cis configuration, of anomers of glycosidic 
linkage at position 1 of the sugar ring; 

(b) reaction pH: 

the glycosyltransferase protein has a high activity 
at or around neutral; or 

(c) divalent ion requirement: 

the activity is enhanced in the presence of at least 
Mn^* or CO^*. 

The glycosyltransferase protein of the present 
invention particularly includes a glycosyltransferase 
protein that has a significant activity for an acceptor 
substrate having an N- linked oligosaccharide with four 
Gaipi-4GlcNAc groups. 

In addition, an aspect of the glycosyltransferase 
protein of the present invention includes a 
glycosyltransferase protein that has any one sequence of 
the following (A) to (C): 
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(A) any one amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO: 16, or SEQ ID NO: 17; 

(B) an amino acid sequence comprising any one amino 
acid sequence of SEQ ID NO: 2, SEQ ID NO: 16, or SEQ ID NO: 

5 17 In which one or several amino acld(s) Is (are) 
substituted, deleted, or Inserted; or 

(C) an amino acid sequence having at least 40% 
Identity to any one amino acid sequence of SEQ ID NO: 2, 
SEQ ID NO: 16, or SEQ ID NO: 17. 

10 In an alternative aspect, the present Invention also 

relates to a nucleic acid encoding a pi , 3-N-acetyl-D- 
glucosaminyltransf erase protein according to any of the 
above -de scribed aspects. 

An aspect of the nucleic acid encoding a 

15 glycosyltransf erase protein of the present Invention 

encompasses a nucleic acid that comprises a full-length 
nucleotide sequence described In SEQ ID NO: 1, nucleotide 
sequence from nucleotide Nos. 76 to 1194 therein, a 
nucleotide sequence from nucleotide Nos. 97 to 1194 therein, 

20 or any of complementary nucleotide sequences thereof. Such 
a nucleic acid can be DNA. 

In a further alternative aspect, the present 
invention also relates to a vector comprising a nucleic 
acid encoding a glycosyltransf erase protein as described 

25 above as well as a transformant comprising the vector. The 
present invention further relates to a method of producing 
a pi , 3-N-acetyl-D-glucosaminyltransf erase protein, 
comprising: growing the above -described transformant and 
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expressing the glycosyltransferase protein to collect the 
glycosyltransf erase protein from the transf ormant • 

In a still alternative aspect , the present invention 
can provide an antibody recognizing a pi, 3-N-acetyl-D- 
glucosamlnyltransf erase protein according to any of the 
above -described aspects. 

The finding that the nucleic acid of the present 
invention encodes a novel glycosyltransferase protein 
suggests that the expression level of the 

glycosyltransferase protein in a cancer tissue exceeds that 
in a normal tissue. Accordingly, it may be possible to 
test the canceration of a biological sample by detecting or 
quantifying the protein of the present invention that is 
expressed in the biological sample to compare its result 
with that of a normal biological sample as a control. 

Thus, the present invention also relates to a method 
of testing the canceration of a biological sample, 
comprising the steps of: 

(a) detecting or quantifying the novel 
glycosyltransferase protein of the present invention in a 
biological sample; and 

(b) diagnosing the biological sample as being 
cancerous when the quantified value of the 
glycosyltransferase protein in the biological sample 
significantly exceeds that of the glycosyltransferase 
protein in a normal biological sample as a control. 

Here, the use of an antibody specifically recognizing 
a glycosyltransferase protein is exemplified for detecting 
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the novel glycosyltransf erase protein. 

PREFERRED MODE OF CARRYING OUT THE INVENTION 

Hereinafter, the present invention will be described 
5 in detail in accordance with embodiments of the present 
invention. 

(1) Nucleic acid of the present invention involved in 
canceration 

The present inventors have found that the canceration 

10 of a normal tissue, for example, a human large intestine 
tissue, without the expression of a nucleic acid having a 
nucleotide sequence described in SEQ ID NO: 1 is confirmed 
to cause the nucleic acid to be expressed therein, and that 
a normal tissue, for example, peripheral blood from a 

15 patient with colorectal cancer, in which a nucleic acid 

having a nucleotide sequence described in SEQ ID NO: 1 is 
generally expressed has a significant increase in the 
expression level of the nucleic acid as compared to that in 
a normal individual. 

20 Thus, a nucleic acid consisting of the nucleotide 

sequence of SEQ ID NO: 1 or a complementary sequence 
thereof is worthy of note as a tximor marker useful in 
excunination for a transcript in a biological sample. 
According to the present invention, a nucleic acid capable 

25 of specifically hybridizing under stringent conditions to 
this tumor marker nucleic acid is provided. 

A primer or probe according to the present invention 
is typically a natural DNA fragment derived from a nucleic 
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acid having a nucleotide sequence of SEQ ID NO: 1, a 
synthetic DNA fragment designed to have a nucleotide 
sequence of SEQ ID NO: 1, or any of complementary strands 
thereof . 

5 Especially the tumor marker nucleic acid was detected 

via a BLAST search and Is transcribed as mRNA encoding a 
structural gene. In general. Its full-length ORF or a 
portion thereof can be present In a sample. From this 
point of view, using the nucleic acid as a targeting primer 

10 or probe, a desired target sequence can be selected from 

across the ORF of the nucleotide sequence of SEQ ID NO: 1. 
The primer or probe of the present Invention can be a 
partial sequence In the nucleotide sequence of SEQ ID NO: 1. 
Using the primer or probe as described above, the 

15 target nucleic acid In a biological sample can be detected 
and/or quantified as described below. Because a genomic 
sequence or the like can be targeted, the nucleic acid of 
the present Invention can also be provided as an antlsense 
primer for medical research or gene therapy. 

20 

Probe of the present Invention 

When the nucleic acid of the present Invention Is 
used as a probe, the nucleic acid Is an oligonucleotide 
with 15 bases or more, preferably 20 bases or more, 
25 selected from the nucleotide sequence of SEQ ID NO: 1 or a 
complementary strand thereof, or alternatively cDNA with a 
maximum length of a full-length ORF region (I.e., 1191 
bases: nucleotide Nos. 1 to 1191) In the nucleotide 
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sequence of SEQ ID NO: 1 or a complementary strand thereof. 

In particular, the probe of the present Invention Is 
widely useful as a reagent or a diagnostic agent for 
medical research. Considering that a nucleic acid having 
5 an exceedingly large molecular weight Is generally 

difficult to handle, a preferred base length of the probe 
Is exemplified by 50 to 500 bases, more preferably 60 to 
300 bases. 

Depending on, for example, the base length or 

10 hybridization conditions adopted, an oligonucleotide probe 
having a relatively short strand can function as a probe 
even if there is a mismatch on the order of one or several 
bases, especially one or two bases, between the 
oligonucleotide probe and the nucleotide sequence of SEQ ID 

15 NO: 1 or the complementary nucleotide sequence thereof. A 
cDNA probe having a relatively long strand can function as 
a probe even if there is a mismatch of 50% or less, 
preferably 20% or less, between the cDNA probe and the 
nucleotide sequence of SEQ ID NO: 1 or the complementary 

20 nucleotide sequence thereof. 

Alternatively, when the nucleic acid of the present 
invention is a synthetic oligonucleotide, the number of 
bases therein is 15 bases or more, preferably 20 bases or 
more. Depending on the base length or hybridization 

25 conditions adopted, the synthetic oligonucleotide can 
function as a probe even if there is a mismatch on the 
order of one or several bases, especially one or two bases, 
between the synthetic oligonucleotide and the nucleotide 

- 18 - 



sequence described In SEQ ID NO: 1 or the complementary 
nucleotide sequence thereof. 

It should be understood that the oligonucleotide 
probe according to the present Invention having 15 bases In 
length could specifically hybridize under stringent 
conditions to the target nucleic acid. Those skilled In 
the art can select a suitable partial sequence with at 
least 15 bases from the nucleotide sequence of SEQ ID NO: 1 
according to various strategies concerning oligonucleotide 
probe design known In the art. Moreover, Information from 
an amino acid sequence of SEQ ID NO: 2 would be helpful In 
selecting a unique sequence likely to be suitable as a 
probe . 

"Under stringent conditions" used herein means 
hybridization under moderately or highly stringent 
conditions. Specifically, the moderately stringent 
conditions are based on, for example, the length of DNA and 
can readily be determined by those having ordinary skill in 
the art. Basic conditions are shown in Sambrook et al. , 
Molecular Cloning: A Ljaboratory Manual, 3rd ed. , Vol. 1, 
7.42-7.45 Cold Spring Harbor Laboratory Press, 2001 and 
include, for a nitrocellulose filter, the use of 
hybridization conditions comprising a prewashing solution 
containing 5xSSC, 0.5% SDS, 1.0 mM EDTA (pH: 8.0) and a 
hybridization solution containing approximately 50% 
formamide, 2xSSC to 6xSSC at approximately 40 to 50''C (or 
other similar hybridization solutions such as a Stark's 
solution in approximately 50% formamide at approximately 



42^C), and washing conditions comprising O.SxSSC, 0.1% SDS 
at approximately 60*^C. The highly stringent conditions are 
also based on, for example, the length of DNA and can 
readily be determined by those skilled In the art. In 
5 general, such conditions Include hybridization and/or 

washing at a temperature higher than that of the moderately 
stringent conditions and/or a salt concentration lower than 
that of the moderately stringent conditions and are defined 
by Involving, for example, the hybridization conditions as 

10 described above and washing In 0 .2x880, 0.1% SDS at 
approximately 63^C. Those skilled in the art would 
appreciate that a temperature and a salt concentration of a 
washing solution Is optionally adjustable according to 
factors such as the length of a probe. 

15 As described above, those skilled In the art can 

readily find and practice moderately or highly stringent 
conditions suitable for a selected probe, based on common 
general technical knowledge In various probe design methods 
and hybridization conditions known In the art as well as 

20 empirical rules that would be obtained through experimental 
means usually used. 

The probe of the present Invention Includes a labeled 
probe attached with a label such as fluorescent , 
radioactive, and blotln labels. In order to detect or 

25 confirm the probe hybridized with the target sequence. One 
example of the labeled probe according to the present 
Invention Is an oligonucleotide consisting of a nucleotide 
sequence of SEQ ID NO: 6 (which corresponds to a 
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complementary strand of a strand from nucleotide Nos. 485 
to 502 In SEQ ID NO: 1). This labeled probe can be used 
for confirming or quantifying a PGR product of the target 
nucleic acid. The labeled probe of the present invention 
5 may also be integrated in a diagnostic DNA probe kit or the 
like or may be immobilized on a chip such as a DNA 
microarray . 

Primer of the present invention 

10 When the nucleic acid of the present invention is 

used as a primer, the nucleic acid is an oligonucleotide. 
Specifically, two regions are selected from the ORF region 
in the nucleotide sequence of SEQ ID NO: 1 to satisfy the 
following conditions : 

15 1) each of the regions has a length of 15 bases or 

more, preferably 18 bases or more, more preferably 21 bases 
or more and no longer than 50 bases; and 

2) each of the regions has a G+C content of 40 to 70%. 
Single- stranded DNAs having the same nucleotide sequences 

20 as those of the two selected regions or nucleotide 

sequences complementary to those of the regions may be 
produced, or otherwise the single-stranded DNAs modified to 
maintain the binding specificity for the nucleotide 
sequences may be produced. Preferably, the primer of the 

25 present invention has a sequence completely complementary 

to a partial sequence in the ORF region of SEQ ID NO: 1 and 
however, may have a one- or two -base mismatch. 

One example of a pair of primers according to the 
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present Invention is a pair of an oligonucleotide 
consisting of a nucleotide sequence described In SEQ ID 
NO: 4 (which corresponds to a complementary strand of a 
strand from nucleotide Nos. 450 to 469 in SEQ ID NO: 1) and 
5 an oligonucleotide consisting of a nucleotide sequence 
described in SEQ ID NO: 5 (which corresponds to a 
complementary strand of a strand from nucleotide Nos. 531 
to 549 in SEQ ID NO: 1). 

A probe selected from the nucleotide sequence 

10 positioned between a pair of primers used can be employed 
for quantifying the target nucleic acid amplified by PGR. 
One example of a labeled probe for detecting a PGR product 
is an oligonucleotide consisting of a nucleotide sequence 
described in SEQ ID NO: 6 (which corresponds to a 

15 complementary strand of a strand from nucleotide Nos. 485 
to 502 in SEQ ID NO: 1). 

(2) Method of testing canceration according to the present 
invention 

According to a method of testing canceration of the 
20 present invention, the transcription level of the target 

nucleic acid in a transcript of a biological scimple can be 
measured. Its measurement result is compared with a result 
from a normal biological sample as a control . If a 
significant difference lies between those results, the 
25 biological sample can be diagnosed as being a cancerous 
tissue. 

In this testing method, a threshold normalized in 
advance on the basis of known data concerning a normal 
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biological sample may be used as a detection result for a 
normal tissue used as a control. For example, when a 
normal tissue Is not obtained from an Identical patient as 
In the test of peripheral blood, comparison with the 
5 average of values measured In normal Individuals Is 
performed . 

As used herein, the significant difference to be 
diagnosed as being cancerous means that the substantial 
presence (i.e., significant concentration) of the target 

10 nucleic acid in a subject tissue is confirmed if the target 
nucleic acid is expressed in a normal tissue as in, for 
example, peripheral blood from a patient with colorectal 
cancer, or that the concentration of the nucleic acid in a 
subject tissue significantly exceeds that in a normal 

15 tissue and preferably the nucleic acid in the subject 

tissue is not less than 1.5 times, preferably 2 times, by 
concentration greater than that in the normal tissue when 
the target nucleic acid is generally expressed in a normal 
tissue as in, for example, a colorectal cancer tissue. 

20 The method of testing canceration according to the 

present invention typically involves a hybridization assay 
and a PGR method. 

Hybridization assay 
25 Examples of a hybridization assay that can be used in 

the present invention include various hybridization assays 
well known to those skilled in the art such as a southern 
blot, northern blot, dot blot, or a colony hybridization 
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technique for a transcript extracted from a biological 
sample . 

Depending on the transcription amount of the target 
nucleic acid or a difference from a normal tissue, a 
5 testing method known in the art Including a quantitative 
hybridization assay such as dot blot or colony 
hybridization used alone or in combination with immunoassay 
may be employed when the quantification of the target 
nucleic acid or an increase in detection level is required. 

10 According to the typical hybridization assay, a 

subject nucleic acid extracted from a biological sample or 
an amplification product thereof is immobilized on a solid 
phase and hybridized under stringent conditions to a 
labeled probe to measure the label bound to the solid phase 

15 after washing. 

Every method known to those skilled in the art can be 
applied to the extraction and purification of a transcript 
from a biological sample. That purified from a biological 
sample and subjected to the hybridization assay is 

20 typically cDNA from the whole transcript of the biological 
sample. However, when canceration is determined only by 
the substantial detection of the target nucleic acid (i.e., 
when no target nucleic acid appears to be expressed in a 
normal tissue) , the use of a testing method such as in situ 

25 hybridization in no need of the purification or the like of 
a transcript would be practical for a subject tissue in 
clinical tests. 
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Testing method by nucleic acid amplification 

On the basis of "the nucleotide sequence of the 
nucleic acid of the present invention" , those skilled in 
the art can appropriately create primers based on 
5 nucleotide sequences positioned at both ends of the nucleic 
acid of the present invention or a partial region of 
interest thereof to be prepared, and readily amplify and 
prepare the region of interest by nucleic acid 
amplification reaction (e.g., PGR) using the primers. 

10 As used herein, examples of the nucleic acid 

amplification reaction include reaction requiring thermal 
cycles such as polymerase chain reaction (PGR) [Saiki R.K., 
et al.. Science, 230, 1350-1354 (1985)], ligase chain 
reaction (LGR) [Wu D.Y., et al. , Genomics, 4, 560-569 

15 (1989); Barringer K. J. , et al.. Gene, 89, 117-122 (1990); 

BaranyF., Proc. Natl. Acad. Sci. USA, 88, 189-193 (1991)], 
and transcription-based amplification [Kwoh D.Y., et al. , 
Proc. Natl. Acad. Sci. USA, 86, 1173-1177 (1989)], and 
isothermal reaction such as strand displacement 

20 amplification (SDA) [Walker G.T., et al., Proc. Natl. Acad. 
Sci. USA, 89, 392-396 (1992); Walker G.T. , et al. , Nuc. 
Acids Res., 20, 1691-1696 (1992)], self -sustained sequence 
replication (3SR) [Guatelli J.G., Proc. Natl. Acad. Sci. 
USA, 87, 1874-1878 (1990)], and Qp replicase system 

25 [Lizardi et al., BioTechnology 6, p. 1197-1202 (1988)]. 
For example. Nucleic Acid Sequence -Based Amplification 
(NASBA) reaction described in European Patent No. 0525882, 
which employs the competitive amplification of a target 
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nucleic acid with a mutant sequence Is also available. 
Preferably, the nucleic acid amplification reaction Is the 
PGR method. 

The target nucleic acid In a transcript can be 
detected using PGR method with, for example, a pair of 
primers of the present invention selected from the target 
nucleic acid. In general, the nucleic acid amplification 
method in itself, such as PGR, is well known in the art and 
is readily carried out because a reagent kit and an 
apparatus for the nucleic acid amplification method are 
commercially available. 

When the primer pair of the present invention is used 
to carry out a nucleic acid amplification method by PGR 
with the subject nucleic acid as a template, the subject 
nucleic acid present in a sample is amplified while no 
amplification takes place in a sample without the subject 
nucleic acid. Therefore, whether or not the subject 
nucleic acid is present in the sample can be determined by 
confirming the presence of an amplification product, and 
the transcription level of the subject nucleic acid, that 
is, the concentration thereof can also be determined by 
quantifying an cunplif ication product. PGR cycles when 
repeated a predetermined number of times would amplify the 
subject nucleic acid to a desired concentration. The 
nucleic acid in a normal tissue can also be measured in a 
similar way. A nucleic acid of a gene extensively and 
generally present in an identical tissue or the like, for 
example, a nucleic acid encoding glyceraldehyde- 3 -phosphate 



dehydrogenase (GAPDH) or p-actln may be used as a control. 

The subject nucleic acid may be the whole mRNA as a 
transcript extracted from a biological sample such as a 
subject tissue or cell or may be the whole cDNA reverse- 
5 transcribed from the mRNA. When mRNA as the subject 

nucleic acid Is cunpllfled, a NASBA method (3SR method, TMA 
method) using the above -described pair of primers may be 
adopted. The NASBA method In Itself Is well known and can 
readily be practiced using the pair of primers because a 

10 kit for the NASBA method is commercially available. 

An amplification product obtained by PGR method can 
be detected or quantified by procedures where the reaction 
solution after amplification is subjected to 
electrophoresis and the resulting bands are stained with 

15 ethidium bromide or the like or procedures where the 

amplification product after electrophoresis is immobilized 
on a solid phase such as a nylon membrane and hybridized to 
a labeled probe specifically hybridizing to the subject 
nucleic acid, for example, a labeled probe described in SEQ 

20 ID NO: 6, to detect the label after washing. 

When the "concentrations" of the target nucleic acids 
are compared between a subject tissue and a normal tissue, 
it is preferable to use a quantitative PGR method including 
a RT-PGR or quantitative real-time PGR method for kinetic 

25 analysis. Because the target nucleic acid previously 

purified is mRNA, a quantitative real-time RT-PGR method is 
particularly preferred. However, the quantification of the 
nucleic acid in the present testing method is not limited 



to the quantitative PCR method, and other DNA 
quantification methods known in the art such as northern 
blot, dot blot, and a DNA microarray using the above- 
described probe for a PCR product are applicable. 

It is also possible to quantify the amount of the 
target nucleic acid in a sample by carrying out 
quantitative RT-PCR using quencher and reporter fluorescent 
dyes. Because especially a kit for the quantitative RT-PCR 
is commercially available, the quantitative RT-PCR can 
readily be carried out. In addition, it is also possible 
to semi-quantify the target nucleic acid based on the 
strength of electrophoretic bands. 

Testing method for effectiveness of cancer therapy 
15 The above -described method of testing canceration 

according to the present invention can also be employed as 
a method of examining the effectiveness of cancer therapy. 
Subjects to be examined are the effectiveness of treatment 
for which the effectiveness of cancer cure should be 
20 examined as well as the effectiveness of treatment given to 
a cancerous cell or tissue or a tumor tissue or the like 
obtained from a model animal for experimental 
carcinogenesis . Such treatment includes every prescription 
such as radiotherapy in addition to the administration of 
25 an anticancer agent. The treatment is given to a cancerous 
biological sample or to the focus of an experimental model 
animal . 

According to a method of examining the effectiveness 
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of cancer therapy according to the present invention, the 
transcription level of the target nucleic acid in a 
biological scimple that has received treatment of interest 
is compared with that before the treatment or without the 
5 treatment. Alternatively, the transcription level may also 
be followed up after the treatment. The treatment can be 
assessed as being effective for cancer therapy if by using 
it the treatment causes the transcription level to be 
significantly reduced or causes an intentional rise in the 

10 transcription level to be significantly suppressed. 

Such examination includes determination about whether 
a candidate substance of an anticancer agent given to a 
cancerous tissue is effective or not, especially whether 
the candidate substance is effective or not for a lesion 

15 tissue or the like in an experimental model animal, and 
determination about whether a novel candidate anticancer 
agent is effective or not for a patient with cancer. To 
the contrary, assessment of whether carcinogenicity is 
suppressed or not in an experimental model animal designed 

20 to develop cancer, that is, whether an expected rise in the 
transcription level is significantly suppressed or not is 
also targeted. 

In the present specification, the "transcription 
level" or "transcription amount" of a nucleic acid refers 

25 to the abundance of the nucleic acid derived from a 
transcript in a fixed amount of a biological scunple. 
Because a nucleic acid can be amplified for quantification 
or the signal level of its label can be cunplified, the 
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amount of the nucleic acid measured can also be expressed 
as an amplified amount or an amplified signal level. 

In the present specification, a "subject nucleic 
acid" or "target nucleic acid" includes not only mRNA and 
5 slRNA but also every type of nucleic acid obtained with 
mRNA as a template, regardless of In vivo or In vitro 
origin . 

In the present specification, a "biological sample" 
refers to an organ, a tissue, and a cell as well as an 

10 experimental animal -derived organ, tissue, and cell, etc., 
and is preferably a tissue. The esophagus, the stomach, 
the pancreas, the liver, the kidney, the duodenum, the 
small Intestine, the large intestine, the rectum, the colon, 
and peripheral blood are concretely exemplified. Preferred 

15 are the large intestine, the rectum, the colon, and 
peripheral blood, and more preferred are the large 
Intestine and peripheral blood. The term "measurement" 
used herein encompasses any of detection, amplification, 
quantification, and semi-quantification. The application 

20 of the nucleic acid of the present invention also includes 
gene therapy • 

The testing method of the present invention is a 
method of testing the canceration of a biological sample as 
described above. The phrase "testing canceration" used 

25 herein includes testing to determine whether the biological 
sample develops cancer or not as well as testing to 
determine whether malignancy is high or not , and can be 
applied to diagnosis, therapy, and so on, for cancer in 
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medical care. Tlie term "cancer" used herein typically 
refers to the entire spectrum of malignant tiimors and 
Includes disease conditions caused by a malignant tumor. 
The testing method of the present Invention Is suitable for 
5 testing, but not limited to, cancer of the esophagus, 

gastric cancer, cancer of the pancreas, cancer of the liver, 
renal cancer, duodenal cancer, cancer of the small 
Intestine, colorectal cancer, cancer of the rectum, cancer 
of the colon, and peripheral blood. Preferred are 

10 colorectal cancer, cancer of the rectum, and cancer of the 
colon, and more preferred Is colorectal cancer. 
(3) Nucleic acid of the present Invention encoding novel 
glycosyltransf erase 

Based on the finding of the nucleic acid described 

15 above, the present Invention also provides a nucleic acid 

encoding a full-length novel glycosyltransf erase protein or 
a fragment thereof. 

The nucleic acid of the present Invention encoding a 
novel glycosyltransf erase Is a nucleic acid consisting of a 

20 nucleotide sequence described In SEQ ID NO: 1 or a 

complementary nucleotide sequence thereof, preferably a 
nucleic acid consisting of a nucleotide sequence from 
nucleotide Nos . 76 to 1194 in SEQ ID NO: 1. The nucleic 
acid of SEQ ID NO: 1 Includes those encoding an amino acid 

25 sequence of SEQ ID NO: 2, and the nucleic acid with the 

nucleotide sequence from nucleotide Nos. 76 to 1194 in SEQ 
ID NO: 1 includes nucleic acids encoding amino acid 
sequences of SEQ ID NOs : 16 and 17. A nucleic acid 
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encoding an amino acid sequence identical to any eunino acid 
sequence encoded by those nucleic acids because of codon 
degeneracy is also encompassed by the present invention. 
As previously described, these nucleic acids are nucleic 
5 acids suitable for use in, for example, the method of 
testing canceration. 

The nucleic acid of the present invention encoding 
the novel glycosyltransf erase includes both single- stranded 
and double -stranded DNAs and also includes RNA complements 

10 thereof. Examples of the DNA include naturally -derived DNA, 
recombinant DNA, DNAs chemically bonded together, DNA 
amplified by PGR, and combinations thereof. However, DNA 
is preferred in light of its stability at the time of 
preparing a vector and a transf ormant • 

15 The nucleic acid of the present invention may be 

prepared by, for example, procedures below. 

At first, a candidate gene likely to encode a homolog 
protein of a pi , 3-N-acetylglucosaminyltransf erase, a 
glycosyltransf erase known in the art, is searched, and its 

20 amino acid (polypeptide) sequence is determined: when a 
program such as BLAST is used to search a gene having a 
homology to a pi , 3-N-acetylglucosaminyltransf erase gene 
from a gene database, for example, a human genome DNA 
sequence (AC011462: Homo sapiens chromosome 19 clone 

25 CTC-435M10) and EST (expressed sequence tag, AW444713) 

sequence likely to encode its homolog protein are found. 

A complementary sequence of the nucleic acid found as 
above or a portion thereof is utilized to carry out nucleic 
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acid amplification reaction from a cDNA library or the like 
according to a standard method using a basic genetic 
engineering approach such as hybridization and nucleic acid 
amplification reaction, thereby allowing the preparation of 
5 the nucleic acid of the present Invention. Because, for 
example, an approximately 1.2-kbp DNA fragment Is obtained 
as a PGR product, this fragment can be separated by a 
method such as agarose gel electrophoresis, which screens 
DNA fragments according to their molecular weights, and 

10 then Isolated according to a standard method such as a 
method for cutting out a certain band. 

Because Its amino acid sequence Is expected, from a 
predicted amino acid sequence (SEQ ID NO: 2), to have a 
transmembrane region at the N-termlnus, the nucleic acid of 

15 the present Invention that encodes a solublllzed form of a 
polypeptide can also be obtained by preparing a region of a 
nucleotide sequence encoding a polypeptide without the 
transmembrane region. In actual experiments conducted by 
the present Inventors, the removal of a region from the 

20 N-termlnus to the 26th to 33rd amino acid of the amino acid 
sequence allowed the preparation of a polypeptide having an 
enzyme activity of Interest . Thus , a nucleic acid ^ 
consisting of a nucleotide sequence from nucleotide Nos. 76 
to 1194 or from nucleotide Nos. 97 to 1194 In SEQ ID NO: 1 

25 Is considered to contain a region encoding an active domain 
region of the enzyme protein. 

A homologous nucleic acid cloned using the 
hybridization and nucleic acid amplification reaction 
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described above has at least 50% Identity, preferably at 
least 60% Identity, more preferably at least 70% or more 
identity, even more preferably at least 80% identity, still 
more preferably at least 90 % or more identity, most 
5 preferably at least 95% identity, to the nucleotide 

sequence described in SEQ ID NO: 1. The nucleic acid of 
the present invention is a nucleic acid encoding a pi,3-N- 
acetylglucosaminyltransf erase protein. In addition to a 
nucleic acid having the nucleotide sequence described in 

10 SEQ ID NO: 1, a nucleic acid encoding a protein (or a 
portion thereof) that is similar in activity, function, 
property, or the like, also comes within the scope of the 
present invention. The activity, property, or the like, of 
a protein of the present invention is described below in 

15 detail in Section (5) G9 enzyme protein of the present 

invention. The nucleic acid of the present invention has 
the closest identity to a pi,3GlcNAc transferase 2 gene 
that is a nucleic acid encoding a pi,3-N- 

acetylglucosaminyltransf erase protein known in the art, and 
20 the identity of these two nucleic acids is 31% in the total 
length and 51% in the active domain (corresponding to SEQ 
ID NO: 17 of the present invention) • Accordingly, a 
nucleic acid that has, preferably at least 55% identity to 
the nucleotide sequence described in SEQ ID NO: 1 and 

25 encodes a pi , 3-N-acetylglucosaminyltransf erase protein 
having a similar property comes within the scope of the 
present invention . 

It is possible to determine the percentage of 
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identity by visual inspection and mathematical calculation. 
Or otherwise, the percentage of identity of two nucleic 
acid sequences can be determined by using the GAP computer 
program, version 6.0 described in Devereux et al., Nucl. 
5 Acids Res. 12: 387, 1984 and available from the University 
of Wisconsin Genetics Computer Group (UWGCG) to compare 
sequence information. Preferred default parameters for the 
GAP program include: (1) a unary comparison matrix 
(containing a value of 1 for identity and 0 for non- 
10 identity) for nucleotide and the weighted comparison matrix 
of Gribskov and Burgess, Nucl. Acids Res. 14: 6745, 1986 as 
described in Schwartz and Dayhoff, eds . , Atlas of Protein 
Sequence and Structure, pp. 353-358, National Biomedical 
Research Foundation, 1979; (2) a penalty of 3.0 for each 
15 gap and an additional penalty of 0.10 for each symbol in 
each gap; and (3) no penalty for end gaps. Other programs 
of sequence comparison used by those skilled in the art are 
also available. 

(4) Vectors and transf ormants of the present invention 
20 According to the present invention, a recombinant 

vector containing the above-described nucleic acid that has 
been isolated is provided. An example of a method for 
incorporating a DNA fragment of the nucleic acid into a 
vector such as a plasmid includes a method described in 
25 Sambrook, J. et al.. Molecular Cloning, A Laboratory Manual 
(3rd edition). Cold Spring Harbor Laboratory, 1.1 (2001). 
Conveniently, a commercially- available ligation kit (e.g., 
from TAKARA SHUZO) can also be used. The recombinant 
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vector (e.g., the recombinant plasmxd) thus obtained Is 
Introduced Into a host cell (e.g., E. coll DH5a, TBI, LE392, 
or XL-LE392 or XL-lBlue) . 

A method for introducing a plasmid into a host cell 
5 includes a calcium chloride method or calcium 

chloride /rubidium chloride method, an electroporation 
method, an electroinjection method, a method by chemical 
treatment such as PEG, and a method using a gene gun or the 
like, described in Sambrook, J. et al. , Molecular Cloning, 

10 A Laboratoiiy Manual (3rd edition). Cold Spring Harbor 
Laboratory, 16.1 (2001). 

The vector can be prepared simply by ligating a 
desired gene with* a vector for recombination (e.g., plasmid 
DNA) available in the art according to a standard method. 

15 Concrete examples of the vector used include, but are not 

limited to, pDONR201, pBluescript, pUC18, pUC19, and pBR322 
as a plasmid derived from E. coli. 

Those skilled in the art can appropriately select 
restriction ends to be compatible with an expression vector. 

20 Those skilled in the art can appropriately select an 
expression vector suitable for a host cell desired to 
express the enzyme of the present invention. Thus, it is 
preferred that the expression vector according to the 
present invention should be constructed so that regions 

25 involved in gene expression (such as promoter, enhancer, 
and operator regions) are properly arranged to allow the 
expression of the nucleic acid in a host cell of interest 
and the nucleic acid is properly expressed. The 
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construction of the expression vector can also employ a 
Gateway system (Invltrogen) which does not rec[ulre 
restriction treatment and ligation procedures. The Gateway 
system Is a system utilizing site- specif Ic recombination, 
5 which allows the cloning of a PGR product while maintaining 
Its orientation and also allows the subclonlng of a DNA 
fragment Into an expression vector properly modified. 
Specifically, an entry clone Is created from a PGR product 
and a donor vector with a BP clonase, a site-specific 

10 recomblnase, and the PGR product Is then transferred to a 
destination vector that Is capable of undergoing 
recombination between the destination vector and the entry 
clone through another recomblnase LR clonase, thereby 
preparing an expression clone compatible with an expression 

15 system. One of the features of the Gateway system Is that, 
once an entry clone Is Initially created, no laborious 
subclonlng step having procedures with restriction enzymes 
and llgases Is required. 

The type of an expression vector Is not particularly 

20 limited as long as the expression vector has the function 
of expressing a desired gene In a variety of prokaryotlc 
and/or eukaryotlc host cells to produce a desired protein. 
Examples of a preferred expression vector Include: pQE-30, 
pQE-60, pMAL-G2, pMAL-p2, and pSE420 for E. coll; pYES2 

25 (Saccharomyces) , and pPIG3.5K, pPIG9K, and pA0815 (Plchla) 
for yeast; and pFastBac, pBacPAK8/9, pBK283, pVL1392, and 
pBlueBac4.5 for Insects. 

The Incorporation of the expression vector of the 
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present invention into a host cell can give a transf ormant . 
The host cell may be a eukaryotic cell (such as a mammalian 
cell, yeast, and an insect cell) or may be a prokaryotic 
cell (such as E. coli and B. subtilis). A host cell for 
5 obtaining the transformant of the present invention is not 
particularly limited and may also be a cultured cell 
derived from humans (e.g., HeLa, 293T, and SH-SY5Y) , mice 
(e.g., Neuro2a, NIH3T3), and so on. Any of these are known 
in the art and are commercially available (e.g., from 

10 DAINIPPON PHARMACEUTICAL) or available from public research 
institutes (e.g., RIKEN Cell Bank). Alternatively, an 
embryo, an organ, a tissue, or a non-human individual may 
also be used. 

Incidentally, the nucleic acid of the present 

15 invention is a nucleic acid found in a human genome library. 
Therefore, in the present invention, by using a eukaryotic 
cell as a host cell for the transformant of the present 
invention, the "nucleic acid of the present invention" 
having a property close to that of a natural one (e.g., 

20 morphology having the addition of an oligosaccharide) will 
be obtained. Thus, it is preferable to select a eukaryotic 
cell, especially a mammalian cell, as the "host cell". The 
mammalian cell is concretely exemplified by a mouse- derived 
cell, and an animal cell is exemplified by a mouse-derived, 

25 Xenopus laevis- derived, rat-derived, hamster-derived, 
monkey -derived, or human-derived cell or cultured cell 
lines established from those cells. E. coli, yeast, or an 
insect cell used as a host cell is concretely exemplified 



by DH5a, M15, JM109, and BL21 (E. coll); INVScl 

( Saccharomyces ) , and GS115 and KM 71 (Plchla) (yeast); and 

Sf21, BmN4, and silkworm larva (Insect cell). 

When a bacterium, especially E* coll. Is used as the 
5 host cell, an expression vector Is generally composed of at 
least a promoter /operator region, an Initiation codon, a 
gene encoding a desired protein, a termination codon, a 
terminator, and a replicable unit. 

When yeast, a plant cell, an animal cell, or an 
10 insect cell is used as a host cell, it is preferred that an 
expression vector should generally contain at least a 
promoter, an initiation codon, a gene encoding a desired 
protein, a termination codon, and a terminator. For 
example, DNA encoding a signal peptide, an enhancer 
15 sequence, 5'- and 3 ' -untranslated regions of a desired gene, 
a selective marker region, or a replicable unit may 
optionally be contained therein. 

In the vector of the present invention, the preferred 
initiation codon is exemplified by a methionine codon (ATG) . 
20 Moreover, the termination codon is exemplified by a 

termination codon regularly used (e.g., TAG, TGA, and TAA) . 

The replicable unit means DNA having the ability to 
replicate its total DNA sequence in a host cell and 
includes a natural plasmid, an artificially-modified 
25 plasmid (plasmid prepared from a natural plasmid), and a 
synthetic plasmid. The preferred plasmid includes: a 
plasmid pQE30, pET, or pCAL, or artificially- modified 
products thereof (DNA fragments obtained by treating pQESO, 
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pET, or pCAL with an appropriate restriction enzyme) for 
E. coli; a plasmid pYES2 or pPIC9K for yeast; and a plasmid 
pBacPAK8/9 for an insect cell. 

Any of those usually used by those skilled in the art 
5 such as enhancer and terminator sequences each derived from 
SV40 can be used as enhancer and terminator sequences. 

Any of those usually used can be used as a selective 
marker according to a standard method. An example thereof, 
includes a gene resistant to an antibiotic such as 
10 tetracycline, ampicillin, or kanamycin or neomycin, 
hygromycin, or spectinomycin. 

The expression vector can be prepared by 
consecutively and circularly ligating at least the above - 
described promoter, initiation codon, gene encoding a 
15 desired protein, termination codon, and terminator region 
with an appropriate replicable unit. On this occasion, an 
appropriate DNA fragment (e.g., a linker, other restriction 
sites) can be used, if desired, according to a standard 
method such as digestion with a restriction enzyme and 
20 ligation using a T4 DNA ligase. 

The introduction [transformation ( transf ection) ] of 
the expression vector of the present invention into a host 
cell can be carried out using a method conventionally known 
in the art. 

25 The expression vector can be transformed into, for 

example, a bacterium (such as E. coli and Bacillus 
subtilis) by, for example, a method of Cohen et al. [Proc. 
Natl. Acad. Sci. USA, 69, 2110 (1972)], a protoplast method 
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[Mol. Gen. Genet., 168, 111 (1979)], and a competent method 
[J. Mol. Biol., 56, 209 (1971)], into Saccharomyces 
cerevisiae by, for example, a method of Hinnen et al. [Proc. 
Natl. Acad. Sci. USA, 75, 1927 (1978)] and a lithium method 
5 [J. B. Bacteriol., 153, 163 (1983)], into a plant cell by, 
for example, a leaf disk method [Science, 227, 129 (1985)] 
and an electroporation method [Nature, 319, 791 (1986)], 
into an animal cell by, for exeunple, a method of Graham 
[Virology, 52, 456 (1973)], and into an insect cell by, for 
10 example, a method of Summer et al. [Mol. Cell Biol., 3, 
2156-2165 (1983)], respectively. 

(5) Isolation and purification of enzyme protein according 
to the present invention 

In recent years, an approach in which a transformant 

15 is cultured and grown and a substance of interest is 

isolated and purified from the cultured and grown products 
has been established as a genetic engineering approach. 

The enzyme protein according to the present invention 
can be expressed (produced), for example, by culturing a 

20 transf ozmant containing the expression vector prepared as 
described above in a nutrient medium. Preferably, the 
nutrient medium contains a carbon source, an inorganic 
nitrogen source, or an organic nitrogen source necessary 
for the growth of a host cell (transformant). The carbon 

25 source is exemplified by glucose, dextran, soluble starch, 
sucrose, and methanol. The inorganic nitrogen source or 
organic nitrogen source is exemplified by ammonium salts, 
nitrates, an amino acid, corn steep liquor, peptone, casein, 
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a meat extract, soy bean cake, and a potato extract. The 
nutrient medliun may also contain other nutrients (e.g., an 
Inorganic salt (e.g., NaCl, calcium chloride, sodium 
dlhydrogenphosphate, and magnesium chloride), vitamins, an 
5 antibiotic (e.g., tetracycline, neomycin, amplclllln, and 
kanamycln)). If desired. Culture Is carried out by a 
method known In the art . Such culture conditions as 
temperature, pH of a medium, and culture time are 
appropriately selected to produce the protein according to 

10 the present Invention In large amounts. 

The protein according to the present Invention can be 
acquired from a cultured product obtained by the culture 
described above, as follows: when the protein according to 
the present Invention Is accumulated within a host cell, 

15 the host cell Is collected by a procedure such as 

centrlf ugatlon and filtration and then suspended In an 
appropriated buffer (e.g., a buffer such as Trls, phosphate, 
HEPES, and MES buffers having a concentration of 
approximately 10 to 100 mM; pH Is preferably In the range 

20 of 5.0 to 9.0, which differs depending on a buffer used), 

followed by the disruption of the cell by a method suitable 
for the host cell used to obtain contents of the host cell 
by centrlf ugatlon; whereas, when the protein according to 
the present Invention Is secreted to the outside of a host 

25 cell, the host cell Is separated from a medlxim by a 

procedure such as centrlf ugatlon and filtration to obtain a 
culture filtrate. A host cell disruption solution or 
culture filtrate can be subjected to the Isolation and 



purification of the protein either directly or after 
subjecting to ammonium sulfate precipitation and dialysis. 
A method of isolating and purifying the protein can include 
the following methods: a method by affinity chromatography 
5 suitable for each tag generally used when the protein is 
attached to a tag such as 6x hlstidlne, GST, or a maltose- 
binding protein; or alternatively a method that will be 
described in detail in Examples below, that is, a method by 
ion exchange chromatography, when the protein according to 

10 the present invention is produced without such a tag. In 
addition, the method can also include a method combining 
gel filtration, hydrophobic chromatography. Isoelectric 
chromatography, and so on. 

The enzyme protein according to the present invention 

15 is allowed to act on a glycoprotein, oligosaccharide, or 

polysaccharide, or the like, thereby transferring a certain 
sugar residue. Thus, the enzyme according to the present 
invention can be used in the modification of an 
oligosaccharide in a glycoprotein and the synthesis of 

20 saccharides. In addition, by administering this enzyme as 
an Immunogen to an animal, an antigen against the enzyme 
can be created, which can in turn be used to measure the 
enzyme by immunoassay. Thus, the enzyme and the nucleic 
acid encoding the enzyme according to the present invention 

25 are useful for creating such an immunogen. Because an 

oligosaccharide structure synthesized by this enzyme seems 
to be Increased in a cancer cell, an antibody against this 
oligosaccharide is probably available as a tumor marker. 
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Among antibodies against an oligosaccharide, for example, 
CA19-9 Is well known to be useful as a tumor marker* 
Similarly, an oligosaccharide structure capable of being a 
tiimor antigen can be synthesized using the G9. 
5 It Is preferred that the expression vector of the 

present Invention should be constructed so that the enzyme 
Is easily Isolated and purified as described above • When 
the enzyme Is prepared by a genetic engineering technique 
using the expression vector according to the present 
10 Invention that has been constructed to be expressed 

especially In the morphology of a fusion protein between a 
polypeptide having an enzyme activity and a labeling 
peptide. Its Isolation and purification would easily be 
performed. 

I5 An example of the identification (labeling) peptide 

described above is a peptide having the function of 
allowing the easy secretion/ separation/purification or 
detection of the enzyme according to the present invention 
from a grown product of a transformant by expressing the 

20 enzyme according to the present invention as a fusion 
protein between the identification peptide and a 
polypeptide having an enzyme activity bound together in the 
preparation of the enzyme by genetic recombination. 
Examples of such an identification peptide include a 

25 peptide such as a signal peptide (peptide consisting of 15 
to 30 amino acid residues, which is present in the N 
terminuses of many proteins and functions within a cell for 
sorting out a protein in intracellular transmembrane 
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mechanisms: e.g., OmpA, OmpT, and Dsb), a protein kinase A, 
a protein A (protein having a molecular weight of 
approximately 42,000, which Is a component of the cell wall 
of Staphylococcus aureus), a glutathion S- transferase, a 
5 His tag (sequence where 6 to 10 hlstldlne residues are 
aligned and arranged), a myc tag (13 -amino acid sequence 
derived from a cMyc protein), a FLAG peptide (analytical 
marker consisting of 8 amino acid residues), a T7 tag 
(which consists of the first 11 amino acid residues In a 

10 gene 10 protein), a S tag (which consists of 15 amino acid 
residues derived from pancreatic RNase A), a HSV tag, pelB 
(22 -amino acid sequence In an E. coll outer membrane 
protein pelB) , a HA tag (which consists of 10 amino acid 
residues derived from hemagglutinin), a Trx tag 

15 (thloredoxln sequence), a CBP tag (calmodulin -binding 

peptide), a CBD tag (cellulose-binding domain), a CBR tag 
(collagen-binding domain), p-lac/blu (p lactamase), p-gal 
(P galactosldase) , luc (luclf erase) , HP-Thlo (Hls-patch 
thloredoxln), HSP (heat shock peptide), Lny (lamlnln y 

20 peptide), Fn (partial peptide of f Ibronectln) , GFP (green 
fluorescent peptide), YFP (yellow fluorescent peptide), 
CFP (cyan fluorescent peptide), BFP (blue fluorescent 
peptide), DsRed and DsRed2 (red fluorescent peptides), MBP 
(maltose- binding peptide), LacZ (lactose operator), IgG 

25 (Immunoglobulin G) , avldln, and a protein G, and any of 
these Identification peptides may be used. Among them, 
especially the signal peptide, protein kinase A, protein A, 
glutathione S- transferase , His tag, myc tag, FLAG peptide, 
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T7 tag, S tag, HSV tag, pelB or HA tag Is preferred because 
the enzyme according to the present invention Is expressed 
and purified more easily according to a genetic engineering 
approach. It Is particularly preferable to obtain the 
5 enzyme according to the present Invention as a fusion 
protein with the FLAG peptide (Asp-Tyr-Lys-Asp-Asp-Asp- 
Asp-Lys) (SEQ ID NO: 6), because of considerably excellent 
handling ease. The FLAG peptide Is highly antigenic, and 
It provides an epitope reverslbly bound by a specific 

10 monoclonal antibody and allows the rapid assay and easy 

purification of a recombinant protein expressed. A murine 
hybrldoma designated as 4E11 produces a monoclonal antibody 
that binds to the FLAG peptide In the presence of a certain 
divalent metal cation, as described In U.S. Patent No. 

15 5,011,912 (Incorporated herein by reference). The 4E11 

hybrldoma cell line Is deposited In American Type Culture 
Collection under Accession No. HB 9259. The monoclonal 
antibody binding to the FLAG peptide Is available from 
Eastman Kodak Co., Scientific Imaging Systems Division, New 

20 Haven, Connecticut. 

A basic vector capable of being expressed In a 
mammalian cell and yielding the enzyme according to the 
present Invention as a fusion protein with the FLAG peptide 
Is, for example, pFLAG-CMV-1 (manufactured by Slgma- 

25 Aldrlch) . Alternatively, a vector capable of being 

expressed In an Insect cell Is exemplified by pFBIF (a 
vector In which a region encoding the FLAG peptide Is 
Incorporated Into pFastBac (Invltrogen) : see Examples 
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below) • However, those skilled In the art can select a 
suitable basic vector, judging from a host cell, a 
restriction enzyme, an identification peptide, and so on, 
used in the expression of the enzyme. 
(6) G9 enzyme protein of the present invention 

As described above, a polypeptide having a certain 
enzyme activity can be isolated and purified using the G9 
nucleic acid of SEQ ID NO: 1 of the present invention on 
the basis of. a genetic engineering approach. 

First, from the above point of view, a typical aspect 
of the protein of the present invention is a G9 enzyme 
protein having an amino acid sequence of SEQ ID NO: 2 
predicted from the nucleic acid sequence of SEQ ID NO: 1. 
Specifically, this enzyme protein has activities described 
below . 

Catalytic reaction 

The G9 enzyme protein can transfer N-acetyl-D- 
glucosamine (GlcNAc) from its donor substrate to its 
20 acceptor substrate through pi, 3 glycosidic linkage and 
synthesize an oligosaccharide. 

Donor substrate specificity : 

Examples of the N-acetyl-D-glucosamine donor 
25 substrate include a sugar nucleotide having this sugar 
residue, for example, uridine diphosphate-N- 
acetylglucosamine ( UDP- GlcNAc ) , adenosine diphosphate-N- 
glucosamine ( ADP -GlcNAc ) , guanosine diphosphate-N- 
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10 



acetylglucosamlne (GDP-GlcNAc) , and cytidlne diphosphate -N- 
acetylglucosamlne (CDP-GlcNAc) . The typical donor 
substrate is UDP-GlcNAc. 



5 Acceptor substrate specificity (see Table 5) ; 

The typical acceptor substrate exhibits a significant 
activity for Bz-p-lactoside and Gaipi-4GlcNAca-pNp and a 
particularly strong activity for Gaipi-4GlcNAca-pNp, of 
pNp-a-Glc, pNp-p-Glc, pNp-a-GlcNAc, pNp-P-GlcNAc, pNp-a-Gal, 

10 oNp-p-Gal, pNp-a-GalNAc, pNp-a-Xyl, oNp-p-Xyl, pNp-a-Fuc, 

Bz-a-Man, Bz-a-ManNAc, Bz-P-lactoside , GlcNAcpi-4GlcNAcp-BZ , 
Gaipi - 4GlcNAca-pNp • 

In the present specification, "GlcNAc" represents a 
N- acetyl -D- glucosamine residue; "GalNAc" represents a 

15 N-acetyl-D-galactosamine residue; "ManNAc" represents a 

N-acetyl-D-mannosamine residue; "Glc" represents a glucose 
residue; "Man" represents a mannose residue; "Gal" 
represents a galactose residue; "Bz" represents a benzyl 
group; "pNp" represents a paranitrophenyl group; "oNp" 

20 represents an orthonitrophenyl group; and "-" represents 

glycosidic linkage. The number in the formula represents a 
carbon ntimber of glycosidic linkage present in the 
oligosaccharide. Besides, "a" and "P" represent anomers of 
the glycosidic linkage at position 1 of the sugar ring, and 

25 the anomer trans to CH2OH or CH3 at position 5 and the 

anomer cis thereto in positional relationship are expressed 
by "a" and "P", respectively. 

Thus, the G9 enzyme protein of the present invention 
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catalyzes reaction given by, for example, the following 
formula : 

UDP-GlcNAc + Gaipi-4GlcNAca-R 

UDP + GlcNAcpi-3Gaipi-4GlcNaca-R, 
wherein R is a glycoprotein, glycolipid, oligosaccharide, 
or polysaccharide, or the like, having the GlcNAc residue. 

The G9 enzyme protein of the present invention 
exhibits the transferring activity for an oligosaccharide 
(e.g., oligosaccharide pyridy lamina ted with 2- 
aminopyridine ) or a glycoprotein having an oligosaccharide 
residue. The G9 enzyme protein exhibits a significantly 
strong activity or a selectively significant activity for 
especially a substrate having the oligosaccharide residue 
in a quadruple -stranded form at the nonreducing end of an 
N- linked oligosaccharide (see Figures 3 and 4). 

Optimum buffer and optimum pH (see Figure 2A) ; 

The G9 enzyme protein has the above -de scribed 
catalysis in both sodium cacodylate and HEPES buffers. 
Generally, it has a high activity at or around neutral. 
Concerning the pH dependence of an activity in each of the 
buffers, the G9 enzyme protein has an activity that rises 
with an increase in pH within the neutral region from pH 
6.4 to 7.2 in the sodium cacodylate buffer and shows the 
maximum activity around pH 7.0 in the HEPES buffer. 

Divalent ion requirement (see Figure 2B) ; 

The activity of the human G9 protein is significantly 
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enhanced In the presence of at least Mn Ion or Co Ion among 
divalent metal Ions and particularly remarkably enhanced In 
the presence of Mn ion. These enhanced activities rapidly 
rise in the low concentration region of the ion and 
5 subsequently gradually decrease. Although Cd and Ni ions 
show slight enhancing effect in the low concentration 
region, no enhancing effect is substantially observed in Mg 
and Zn ions. Usually, most of glycosyltransf erases 
requiring a divalent ion have reaction enhanced in the 

10 presence of Mn ion. 

As described above, the G9 enzyme protein of the 
present invention can transfer a GlcNAc residue to a 
certain oligosaccharide through pi,3 glycosidic linkage 
under the given enzyme reaction conditions described above 

15 and as such, is useful in the synthesis or modification 
reaction of an oligosaccharide of a glycoprotein or the 
like. 

Second, in the present specification, the disclosure 
of the amino acid sequence described in SEQ ID NO: 2 that 

20 represents the primary structure of the above -described 

protein provides every type of protein that can be produced 
by a genetic engineering approach well known in the art on 
the basis of the amino acid sequence (hereinafter, also 
referred to as a "mutant protein" or "modified protein"). 

25 That is, the enzyme protein of the present invention is not 
limited to a protein consisting of the predicted amino acid 
sequence of SEQ ID NO: 2 according to common general 
technical knowledge in the art and as illustrated below, is 
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Intended to Include a protein consisting of a polypeptide 
being not full-length that partially lacks, for example, 
the N- terminal side of the amino acid sequence, or even a 
protein with an amino acid sequence homologous to those 
5 amino acid sequences, which has intrinsic properties of the 
proteins . 

Initially, the human G9 enzyme protein of the present 
invention can be composed of preferably an amino acid 
sequence described in SEQ ID NO: 16 (amino acid sequence 
10 from amino acid No. 26 to the C terminus in SEQ ID NO: 2), 
more preferably an amino acid sequence described in SEQ ID 
NO: 17 (amino acid sequence from amino acid No. 33 to. the C 
terminus in SEQ ID NO: 2), as has been obtained in the 
example below. 

15 In general, it is well known that a protein having 

physiological activities such as an enzyme can maintain the 
physiological activities even if one or several amino 
acid(s) is (are) substituted or deleted in any of the above- 
described amino acid sequences or one or several amino 

20 acid(s} is (are) inserted or added to any of the cimino acid 
sequences . It is also known that there exist mutant 
proteins among native proteins, which have the mutation of 
one or several amino acid(s) due to the variety of living 
species producing them, gene mutation caused by variations 

25 in ecotype, or the presence of greatly similar isozymes, or 
the like. From this point of view, the protein of the 
present invention also includes a mutant protein that has 
an amino acid sequence comprising each amino acid sequence 
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shown in SEQ ID NO: 2, 16, or 17 in which one or several 
amino acid(s) is (are) substituted or deleted or to which 
one or several amino acid(s) is (are) inserted or added, and 
has an activity of transferring a GlcNAc residue to a Gal 
5 residue through pi , 3-glycosidic linkage under a certain 
enzyme reaction conditions described above. In addition, 
the particularly preferred modified protein includes a 
protein that has an amino acid sequence comprising each 
amino acid sequence shown in SEQ ID NO: 2 in which one or 
10 several amino acid(s) is (are) substituted or deleted or to 
which one or several amino acid(s) is (are) inserted or 
added • 

In the description above, "several" refers to 
preferably 1 to 200, more preferably 1 to 100, even more 
15 preferably 1 to 50, most preferably 1 to 20. In general, 

the numbers of amino acids that can be substituted with the 
activity of the original protein maintained when an amino 
acid is substituted by site- specif ic mutation is preferably 
1 to 10. 

20 Moreover, the modified protein of the present 

invention includes a modified protein obtained by the 
substitution of an amino acid by another having a property 
similar thereto. That is, an approach for creating a 
recombinant protein having desired mutation by introducing 

25 the substitution of an amino acid by another having a 
property similar thereto (e.g., the substitution of a 
hydrophobic amino acid by another hydrophobic amino acid, 
the substitution of a hydrophilic amino acid by another 
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hydrophillc amino acid, the substitution of an acidic amino 
acid by another acidic amino acid, or the substitution of a 
basic amino acid by another basic amino acid) is well known 
to those skilled. in the art, and most of the modified 
5 proteins thus obtained have properties similar to those of 
the original proteins. From this point of view, a modified 
protein allowed to have such amino acid substitution is 
also encompassed by the present invention. 

In addition, the modified protein of the present 

10 invention may be a glycoprotein in which an oligosaccharide 
binds to the polypeptide as long as it has the amino acid 
sequence as described above and has an enzyme activity 
intrinsic to an enzyme of interest. 

When the amino acid sequence of the present invention 

15 is subjected to an identity search provided by GENETYX for 
identifying the range of the homologous protein of the 
present invention, it is understood that the amino acid 
sequence has the highest identity to that of a pi,3-N- 
acetylglucosaminyl transferase 2 protein known in the art, 

20 and that the identity of these two amino acid sequences is 
37% in the total length and 39% in the active domain (which 
corresponds to SEQ ID NO: 17). From this point of view, an 
amino acid sequence preferred as the homologous protein of 
the present invention may generally have at least 40% 

25 identity, more preferably at least 50% identity, 

particularly preferably at least 60% identity to the amino 
acid sequence shown in SEQ ID NO: 2, 16, or 17. The pl,3- 
N-acetylglucosaminyltransf erase protein having a property 
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similar to those of proteins having the amino acid 
sequences comes within the scope of the present Invention. 

The GENETYX Is genetic Information processing 
software for nucleic acid analysis and protein analysis and 
5 Is capable of signal peptide prediction, promoter site 

prediction, and secondary structure prediction In addition 
to usual homology analysis and mult 1- alignment analysis. 
The homology analysis program used herein adopts Llpman- 
Pearson method (Llpman, D.J. & Pearson, W.R., Science, 277, 

10 1435-1441 (1985)) that Is frequently used as a method with 
high throughput and high sensitivity. 

In the present specification, percentage of Identity 
may be determined by the comparison with sequence 
Information using, for example, a BLAST program described 

15 by Altschul et al. (Nucl. Acids. Res., 25. 3389-3402 

(1997)) or FASTA described by Pearson et al. (Proc. Natl. 
Acad. Scl. USA, 2444-2448 (1988)). The programs are 
available on the Internet from the website of National 
Center for Biotechnology Information (NCBI) or DNA Data 

20 Bank of Japan (DDBJ). A variety of conditions (parameters) 
for an Identity search provided by each of the programs are 
described In detail In the website. Although a part of the 
configurations can appropriately be altered, the search Is 
generally conducted using default values. It Is noted that 

25 other sequence comparison programs used by those skilled In 
the art may also be employed. 

Third, as described below, the Isolated protein of 
the present Invention can be administered as an Immunogen 
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to an animal to thereby produce an antibody against the 
protein* The enzyme can be measured and quantified by 
Immunoassay using such an antibody. Thus, the present 
Invention Is useful for creating such an Immunogen. From 
5 this point of view, the protein of the present invention 
Includes a polypeptide fragment, a mutant, and a fusion 
protein from the protein, which contains an antigenic 
determinant or epitope for eliciting the formation of an 
antibody. 

10 (7) Antibody recognizing protein according to the present 
invention 

In the present specification, an antibody 
immunoreactlve to a glycosyltransf erase protein encoded by 
the nucleic acid of the present invention is provided. 

15 Such an antibody specifically binds to the 

glycosyltransf erase protein via the antigen-binding site of 
the antibody (in contrast to non-specific binding). Thus, 
the protein of SEQ ID NO: 2, 16, or 17, or a fragment, 
mutant, or fusion protein thereof, or the like, as 

20 described above may be used as an "immunogen" for producing 
an antibody Immunoreactlve thereto. To be more specific, 
the protein, the fragment, mutant, and fusion protein 
thereof, and so on, contain an antigenic determinant or 
epitope for eliciting the formation of an antibody. The 

25 antigenic determinant or epitope may be either linear or 

conformational (Interrupted). The antigenic determinant or 
epitope may be identified by any method known in the art. 

Thus, one aspect of the present invention relates to 
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an antigenic epitope of a glycosyltransf erase protein 
encoded by the nucleic acid of the present Invention. Such 
an epitope Is useful for creating an antibody, especially a 
monoclonal antibody, as more fully described below. 
5 Furthermore, the epitope of the glycosyltransf erase protein 
according to the present Invention may be used In assay and 
as a research reagent for purifying an antibody 
specifically bound with the epitope from a substance such 
as a supernatant derived from polyclonal serum or a 

10 cultured hybrldoma. Such an epitope or a mutant thereof 

can be produced using a technique known In the art such as 
solid phase synthesis and chemical or enzymatic cleavage of 
a protein, or using a recombinant DNA technique. 

For an antibody likely to be Induced by the 

15 glycosyltransf erase protein, both polyclonal and monoclonal 
antibodies can be prepared by a routine technique even If 
the whole protein or a portion thereof Is Isolated or the 
epitope Is Isolated. See, e.g., Kennet et al., ed.. 
Monoclonal Antibodies, Hybrldomas: A New Dimension In 

20 Biological Analyses, Plenum Press, New York, 1980. 

The present specification Is also directed to a 
hybrldoma cell line that produces a monoclonal antibody 
specific to the glycosyltransf erase protein of the present 
Invention. Such a hybrldoma can be produced and Identified 

25 by a routine technique. One method for producing such a 
hybrldoma cell line comprises : Immunizing an animal with 
the glycosyltransf erase protein; collecting spleen cells 
from the Immunized animal; fusing the spleen cells with 
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myeloma cell lines to thereby produce hybrldoma cells; and 
identifying a hybrldoma cell line producing a monoclonal 
antibody that binds to the enzyme. The monoclonal antibody 
can be collected by a routine technique. 
5 The monoclonal antibody according to the present 

. invention includes a chimeric antibody, for example, a 
humanized murine monoclonal antibody. When such a 
humanized antibody is prepared by a known technique and the 
antibody is administered to a hximan, an advantage of 
10 reduced immunogenicity may be provided. 

An antigen-binding fragment of an antibody, which can 
be produced by a routine technique, is also encompassed by 
the present invention. Examples of such a fragment include, 
but are not limited to. Fab and F(ab')2- An antibody 
15 fragment and a derivative produced by a genetic engineering 
technique are also provided. 

The antibody according to the present invention can 
be used in assay for detecting the presence of the 
glycosyltransf erase protein or a fragment thereof either 
20 In vitro or In vivo. The antibody can also be employed 

when the polypeptide of the present invention or a fragment 
thereof is purified by immunoaf f inity chromatography. 

Furthermore, the use of a binding partner, for 
example, an antibody capable of blocking the binding of the 
25 glycosyltransf erase protein to an acceptor substrate allows 
the inhibition of a biological activity generated from such 
binding. Such a blocking antibody may be identified using 
any suitable assay method such as the test of antibody for 
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the ability to inhibit the binding of the protein to a 
certain cell in which an acceptor substrate is being 
expressed. Alternatively, the blocking antibody can be 
identified in assay for the ability to inhibit biological 
5 influence arising from the enzyme according to the present 
invention bound with a binding partner for a target cell. 

By using such an antibody an In vitro method or 
administering the antibody In vivo, a biological activity 
mediated by an entity producing the antibody can be 

10 inhibited. Thus, it is possible to treat disorders caused 
or deteriorated (directly or indirectly) by the interaction 
between the glycosyltransf erase protein of the present 
invention and a binding partner. The therapy involves 
administrating, to a mammal in vivo, the blocking antibody 

15 in an effective amount for inhibiting a binding partner- 
mediated biological activity. In general, a monoclonal 
antibody is preferred for use in such therapy. In one 
aspect, an antigen-binding antibody fragment is used. 
(8) Finding of murine G9 and construction of genetically- 

20 engineered animal 

As the above -de scribed human G9 has been found, the 
present inventors have conducted a search from a gene 
database using a BLAST search and so on and consequently 
detected the presence of murine orthologs of the human G9 

25 (SEQ ID NOs: 18 to 20). 

A murine G9 nucleic acid has been found as a nucleic 
acid having ORF with 1170 bases in length (SEQ ID NO: 19) 
in a sequence with a total of 1845 bases in length (SEQ ID 
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NO: 18). This ORF Is estimated to encode an amino acid 

sequence (SEQ ID NO: 20) of a murine pi , 3-N-acetyl-D- 
glucosamlnyltransf erase protein consisting of 389 amino 
acid residues. In the present specification, every 
5 investigation and deliberation conducted on the human G9 
can similarly be applied to the murine G9. 

Based on the finding of the murine G9, the present 
invention also provides means for the expression and 
function analysis of the G9 at the animal individual level 
10 which is based on a variety of gene conversion techniques 
using fertilized eggs and ES cells and typically provides 
even a transgenic animal into which the G9 gene is 
Introduced and a knockout mouse that is deficient in the 
murine G9 . 

15 For example, the knockout mouse can be constructed 

according to a standard method in the art (see e.g.. The 
Latest Techniques for Gene Targeting, K. Yagl, ed. , 
YODOSHA; Gene Targeting, supervisory for translation by 
T. Noda, MEDICAL SCIENCES INTERNATIONAL). That is, those 

20 skilled in the art can acquire a homologously recombinant 
ES cell of the murine G9 (mG9) according to a gene 
targeting method known in the art using the sequence 
information of murine G9 nucleic acid disclosed in the 
present application and can construct a G9 knockout mouse 

25 using the ES cell (see Example 6). 

Alternatively, a method for suppressing gene 
expression by a small Interfering RNA method has recently 
been developed (T.R. Brummelkamp et al.. Science, 296, 
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550-553 (2002)), and the G9 knockout mouse can be 
constructed according to such a method known in the art. 

Providing the G9 knockout mouse would be helpful in 
elucidating the involvement of the G9 gene in a certain 
5 life phenomenon, that is, information on the redundancy of 
the gene as well as the relationship between the deficiency 
of the gene and a phenotype (including every type of 
abnormality for motion, intelligence, and sensory function) 
at an individual level and even the function of the gene in 

10 individual lifecycle such as development, growth, and aging. 
To be more specific, the knockout mouse obtained by the 
above -described method can be used to investigate the 
detection of the carriers of oligosaccharides synthesized 
by the G9 and mG9 and the relationship with physiological 

15 function and disease, and so on. For example, a 

glycoprotein and a glycolipid are extracted from each 
tissue excised from the knockout mouse and compared with 
those from a wild- type mouse by a technique such as 
proteomics (e.g., two-dimensional electrophoresis, 

20 two-dimensional thin-layer chromatography, and mass 

spectrometry) to thereby allow the identification of the 
carriers of the synthesized oligosaccharides. Moreover, 
the comparison of phenotypes (e.g., fetation, growth 
process, and spontaneous behavior) between the knockout 

25 mouse and a wild- type mouse allows the estimation of the 
relationship with physiological function and disease. 

Hereinafter, the present invention will be described 
more fully with reference to examples. However, the 
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present Invention Is not Intended to be limited to examples 
described below. 

Examples 

5 Example 1 Cloning of DNA of the present Invention 

As a result of searching a gene having homology to a 
Pl , 3-N-acetylglucosaminyltransf erase gene or a gene likely 
to encode a protein having homology to the enzyme at an 
amino acid level from a gene database using programs such 

10 as Blast [Altschul et al., J. Mol. Biol. 215 ^ 403-410 

(1990)], the FASTA method, the PSI-BLAST method, and the 
FrameSearch method [manufactured by Compugen], a human 
genome DNA sequence (AC011462: Homo sapiens chromosome 19 
clone CTC-435M10) and a EST (AW444713) sequence, and the 

15 like were found. A polypeptide encoded by each of these 
sequences Is considered to be a homolog protein of the 

pi , 3-N-acetylglucosamlnyltransf erase and was designated as 
G9. 

Unless otherwise specified in the description below, 
20 a method known in the art which is described in Molecular 
Cloning 2nd Ed. was used as a genetic engineering approach. 

RNA was extracted from a colorectal cancer cell line 
colo 205 with RNeasy Mini Kit (manufactured by Qiagen) to 
synthesize single strand DNA by an ollgo(dT) method using 
25 Super- Script First -Strand Synthesis System (manufactured by 
Invltrogen) . This DNA was used as a template to carry out 
PGR with a 5' primer (SEQ ID NO: 3) and a 3' primer (SEQ ID 
NO: 4). PGR conditions comprised 25 cycles, each cycle 
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having 94**C for 30 seconds, 65**C for 1 minute, and 72^C for 
1 minute. A DNA fragment obtained by PCR has, as 
restriction sites, Hlndlll on the 5' side of the initiation 
codon of ORF and EcoRI on the 3 ' side of the stop codon 
5 thereof . 

This DNA fragment and pBluescript (R) II SK(-) 
(manufactured by TOYOBO) were individually treated with 
restriction enzymes Hlndlll and EcoRI, with which a 
reaction solution was then mixed, followed by ligation 

10 reaction to thereby introduce the ORF of the G9 into the 
pBluescript (R) II SK(-). The reaction solution was 
purified by an ethanol precipitation method and then mixed 
with a competent cell (E. coll DH5a) . The mixture was 
subjected to a heat shock method (42''C, 30 sec.) and seeded 

15 to a LB agar medium containing IPTG and X-gal. The next 

day, a single white colony was further cultured to collect 
plasmid DNA. 

The collected plasmid DNA was confirmed to contain 
the nucleic acid sequence of interest and the nucleotide 
20 sequence was determined (SEQ ID NO: 1). A predicted open 

reading frame (ORF) in that nucleotide sequence is 1194 bp, 
and its predicted amino acid sequence consists of 397 amino 
acids (SEQ ID NO: 2). The predicted amino acid sequence 
has a hydrophobic amino acid region characteristic of a 
25 glycosyltransf erase at its N terminus. Those consisting of 
the nucleic acid sequence and the sunino acid sequence were 
designated as G9 . 

The pBluescript (R) II SK (-) into which the G9 is 
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incorporated is a multifunctional vector developed for 
carrying out cloning procedures and sequencing procedures 
more conveniently and has various improvements in addition 
to the function of conventional pUC and M13 vectors • 
5 Because a multi- cloning site is contained in a LacZ gene as 
with the pUC vector, the plasmid incorporating an insert 
therein is transformed into E. coli having the genotype of 
lacZAMlS such as XLl-Blue MRF' and JM109, which in turn 
forms a white colony in a plate supplemented with IPTG/X- 

10 gal. Thus, the presence or absence of the insert can be 

easily assessed. Moreover, because the multi-cloning site 
has a polylinker consisting of 21 restriction sites, the 
range of choices for restriction enzymes used are extended 
when a deletion mutant is created by Exo/Mung System. The 

15 incorporated G9 gene can be adjusted in its expression 
within E. coli having laclg mutation through LacZ 
operator /promoter , and the E. coli is allowed to produce a 
protein of interest by the addition of IPTG to a medium. 
Furthermore, because T3 and T7 promoters are present on 

20 both sides of that multi-cloning site, it is possible to 
create RNA probes with these promoters. BssHII sites are 
present on both sides of each of those promoters and can be 
utilized to cut out inserted DNA together with the promoter 
sequences. Using the probes from both of the promoters, 

25 gene mapping can be carried out. Because such a vector 
contains the replication origin of an fl phage, single- 
stranded DNA is produced by the infection of a VCSM13 or 
R408 helper phage and can be used in sequencing and Site 
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Specific Mutagenesis. An antlsense strand Is rescued by 
the Infection of the helper phage. 

Example 2 Expression level of DNA of the present Invention 
5 In human colorectal cancer tissue 

The expression level of the G9 gene In normal and 
colorectal cancer tissues from identical patients were 
compared using a quantitative real-time PGR method. 

The quantitative real-time PGR method is a method 

10 that combines sense and antlsense primers with a 

fluorescently- labeled probe in PGR. In amplification by 
PGR, the fluorescent label of the probe comes off and shows 
fluorescence. Fluorescent intensity is amplified in 
correlation with the amplification of a gene and as such, 

15 used as an indicator to conduct quantification. 

RNA was extracted from human colorectal cancer 
tissues and normal and colorectal cancer tissues from 
identical patients with RNeasy Mini Kit (manufactured by 
Qiagen) to synthesize single strand DNA by an oligo(dT) 

20 method using Super-Script First-Strand Synthesis System 
(manufactured by Invitrogen) . This DNA was used as a 
template to carry out quantitative real-time PGR with ABI 
PRISM 7700 (manufactured by Applied Biosystems Japan) using 
a 5' primer (SEQ ID NO: 5), a 3* primer (SEQ ID NO: 6), and 

25 a TaqMan probe (SEQ ID NO: 7). PGR conditions comprised 
reaction at 50°G for 2 minutes and 95^G for 10 minutes, 
followed by cycles repeated 50 times, each cycle having 
95^G for 15 seconds and 60°G for 1 minute. The obtained 
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measurement values were divided by a value from p- act In as 
an Internal standard gene quantified using a kit 
manufactured by Applied Blosystems Japan Ltd. , In order to 
correct variations cimong Individuals. Comparison was made 
5 between the measurement values of the hvunan colorectal 
cancer tissues and those of the normal and colorectal 
cancer tissues from Identical patients. 

The result has demonstrated that the transcript from 
the DNA of the present Invention Is not present or Is too 
10 negligible to measure In the non-cancerous tissues and that 
the transcript from the DNA of the present Invention Is 
significantly present In the cancerous tissues (see 
Table 3 ) . 

15 Table 3 



Patient 
No. 


Normal 
tissue 


Cancer 
tissue ■ 


Patient 1 


0 


0.0052 


Patient 2 


0 


0.0004 


Patient 3 


0 


0.0023 


Patient 4 


0 


0.0012 


Patient 5 


0 


0.0018 


Patient 6 


0 


0.0028 


Patient 7 


0 


0.0007 


Patient 8 


0 


0.0057 


Average 


0.0000000 


0.0025125 



Expression level of DNA of the present Invention In human 
peripheral blood 

The expression level of the G9 gene In peripheral 
20 blood of normal Individuals and patients with colorectal 
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cancer was compared using a quantitative real- time PGR 
method • 

Blood was collected Into a PAXgene blood RNA tube 
(manufactured by PreAnalytlx) from healthy volunteers and 
5 patients with colorectal cancer. After the blood was mixed 
by Inversion with reagents In the tube and reacted at room 
temperature for 24 hours, RNA was extracted with a PAXgene 
blood RNA kit (manufactured by PreAnalytlx) . Using Super - 
Script First -Strand Synthesis System (manufactured by 

10 Invltrogen), cDNA was synthesized with accompanying random 
primers • This DNA was used as templates to carry out 
quantitative real-time PGR with ABI PRISM 7700 
(manufactured by Applied Biosystems Japan) using a 5' 
primer (SEQ ID NO: 4), a 3* primer (SEQ ID NO: 5), and a 

15 TaqMan probe (SEQ ID NO: 6). PGR conditions comprised 
reaction at 50°G for 2 minutes and SS^'C for 10 minutes, 
followed by cycles repeated 50 times, each cycle having 
95**G for 15 seconds and eo'^C for 1 minute. The obtained 
measurement values were divided by a value from p-actin as 

20 an internal standard gene quantified using a kit 

manufactured by Applied Biosystems Japan Ltd. , in order to 
correct variations among individuals. Comparison was made 
between the normal individuals and the patients with 
colorectal cancer. 

25 The result has demonstrated that the transcription 

level of the DNA of the present invention In the peripheral 
blood from the patients with colorectal cancer is 
significantly greater than that in the peripheral blood 
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from the normal Individuals. Wien patients with colorectal 
cancer having the measurement value that exceeds the 
average measurement value of normal lndlvlduals+ (standard 
deviation x2) were assessed to be positive, the positive 
5 rate of the patients with colorectal cancer was 67% (see 
Table 4 ) . 



Table 4 



No. 


Normal 
Individual 


Patient with 
colorectal cancer 


Assessment 


1 


90 


121 


Positive 


2 


107 


68 


Negative 


3 


82 


199 


Positive 


4 


81 


418 


Positive 


5 


87 


123 


Positive 


6 




92 


Negative 


7 




196 


Positive 


8 




86 


Negative 


9 




473 


Positive 


10 




267 


Positive 


11 




110 


Positive 


12 




46 


Negative 


Average 


89.4 


183.3 




Standard 
deviation 


9 . 4 


132. 1 





10 Expression level of DNA of the present invention in a 
variety of human noarmal tissues 

The expression level (at cDNA) of the DNA of the 
present invention derived from human normal tissues was 
compared using a quantitative real-time PGR method in the 
15 same way as above. RNA from these tissues is commercially 
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available from Clontech Inc., and so on. The synthesis of 
cDNA employed Super- Script First -Strand Synthesis System 
(manufactured by Invitrogen) . However, pCR2 . 1 (Invitrogen) 
DNA containing a GAPDH ( glyceraldehyde- 3 -phosphate 
5 dehydrogenase) gene was used for creating the calibration 
curve of an internal standard. The pBluescript (R) II SK(-) 
vector DNA containing the ORF of the cloned G9 nucleic acid 
was used for creating the calibration curve of the G9. 

As a result, the expression of the DNA of the present 

10 invention is observed in most of the human normal tissues. 
It has revealed that the expression is relatively high in 
especially the bone marrow, the spleen, and the small 
intestine and however, the large intestine and the prostate 
have a considerably weak expression or substantially no 

15 expression (see Figure 1). 

Example 3 Expression of isolated full-length G9 

The plasmid DNA where the G9 was incorporated into 
the pBluescript (R) II SK(-) and pcDNA3.1 (manufactured by 

20 Invitrogen) were individually treated with restriction 

enzymes Hindlll and EcoRI, with which a reaction solution 
was then mixed, followed by ligation reaction to thereby 
introduce the ORF of the G9 into the pcDNA3.1(+). The 
reaction solution was purified by an ethanol precipitation 

25 method. Then, by investigating the sequence, the G9 was 
confirmed to be introduced into the pcDNA3.1{+) and this 
was designated as pcDNA3 . 1 ( + ) -G9 • The pcDNA3 . 1 ( + ) -G9 was 
mixed with a competent cell (E. coli DH5a) . The mixture 
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was subjected to a heat shock method (42^C, 45 sec.) and 
seeded to a LB agar medium containing amplclllln. The next 
day, a single colony was further cultured to collect 
plasmld DNA. The collected plasmld DNA was confirmed to 
5 contain the nucleic acid sequence of Interest and the 
nucleotide sequence was determined. 

The pcDNA3 . 1( +/- ) Is an expression vector for a broad 
variety of mammalian cells. It Is a vector for forward 
transcription. In which a sequence likely to form the 

10 secondary structure of RNA Is removed from a multl-clonlng 
site (MCS) sequence by Improving conventional pcDNA3 • 1 for 
enhancing expression level. The pcDNA3 . 1 (+/- ) has the 
enhancer/promoter of CMV and allows a high level of 
expression. RNA is stabilized by a polyadenylated signal 

15 and a transcription termination sequence. Because there 
exists a SV40 origin, the pcDNAS . 1 ( +/- ) can be replicated 
in a cell in which a SV40 Large T antigen is being 
expressed. An eunpiclllin-resistant gene is introduced 
therein for selection in E. coll. A neomycin-resistant 

20 gene is also introduced therein for selection for producing 
a stable strain in a mammalian cell. 

HCT15 cells, human colorectal cancer -derived cell 
lines, were used to conduct procedures below for creating 
G9-expressing stable strains. The HCT15 cells were 

25 suspended at 2x10^ cells in 10 ml of a RPMI-1640 medium 

containing 10% fetal bovine serum but no antibiotic, then 
seeded to a 10 -cm dish, and cultured in a CO2 incubator at 
37°C for 16 hours. The plasmld DNA (20 ng) of the 
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pcDNA3, 1(+) -G9 and 30 ^1 of Llpofectamlne 2000 
(manufactured by Invitrogen) were mixed with 1.5 ml of 
OPTI-MEM (manufactured by Invitrogen) , respectively, and 
incubated at room temperature for 5 minutes. Further, 
5 these two solutions were gradually mixed and incubated at 
room temperature for 20 minutes. This mixture solution was 
added dropwise to the dish and cultured in a CO2 incubator 
at 37*^0 for 48 hours. The cells were subcultured by a 
standard method. On this occasion, RPMI-1640 (manufactured 

10 by Invitrogen) was used as a medium, to which fetal bovine 
serum, and penicillin (manufactured by Invitrogen), 
streptomycin (manufactured by Invitrogen), and Geneticin 
(neomycin; manufactured by Invitrogen) as antibiotics were 
added. Because cells in which the pcDNA3 . l(-i-) -G9 has not 

15 been introduced are allowed to die out by the addition of 
Geneticin, continued culture results in the survival of 
only cells in which the pcDNA3 . 1(+) -G9 has been introduced. 
These cells were used as 69-expressing stable strains. 



20 Expression of G9 recombinant protein in mammalian cell line 
For obtaining a G9 recombinant protein, the G9 was 
expressed in a human kidney -derived cell line 293T. Only 
the expression of an active region from the 105th amino 
acid to the C terminus in SEQ ID NO: 2 which relatively 

25 retains homology to at least pi,3GlcNAc transferase and 
pl,3Gal transferase may be adequate for investigating a 
function. However, here, it has been decided to express 
two predicted active regions from the 24th amino acid and 

- 70 - 



the 33rd amino acid to the C terminus In the G9. 

Using, as a template, the plasmld DNA where the G9 
was Incorporated Into the pBluescrlpt (R) II SK(-), PGR 
reaction was carried out with each of 5' primers (SEQ ID 
5 NOs: 8 and 9) and a 3' primer (SEQ ID NO: 10) to obtain a 
DNA fragment of Interest. The PGR method comprised 25 
cycles, each cycle having 94°G for 30 seconds, SS^'G for 
1 minute, and 72*'C for 1 minute. Then, a PGR product was 
subjected to agarose gel electrophoresis. The gel was cut 

10 out by a gel excision method to Isolate the PGR product by 
a standard method. This PGR product had Hindi 1 1 on the 5' 
side and EcoRI on the 3 ' side as restriction sites • This 
DNA fragment and pFLAG-GMV3 were Individually treated with 
restriction enzymes Hlndlll and EcoRI, with which a 

15 reaction solution was then mixed, followed by ligation 
reaction to thereby Introduce the DNA fragment into the 
pFLAG-GMV3. The reaction solution was purified by an 
ethanol precipitation method and then mixed with a 
competent cell (E. coll DH5a) . The mixture was subjected 

20 to a heat shock method (42°G, 45 sec.) and seeded to a LB 
agar medium containing amplclllln. 

The next day, the DNA of Interest In the resulting 
colony was directly confirmed by PGR. After the DNA 
sequence was confirmed by sequencing for additional 

25 confirmation, the vector (pFLA6-CMV3) was extracted and 
purified. 

Human kidney cell -derived cell line 293T cells were 
suspended at 2x10^ cells in 10 ml of a DMEM medium 



containing 10% fetal bovine serxim but no antibiotic, then 
seeded to a lO-cm dish, and cultured In a CO2 Incubator at 
aV^'C for 16 hours. The pFLAG-CMV3-G9 (20 ng) and 30 \il of 
Llpof ectamlne 2000 (manufactured by Invltrogen) were mixed 
5 with 1.5 ml of OPTI-MEM (manufactured by Invltrogen), 
respectively, and Incubated at room temperature for 5 
minutes. Further, these two solutions were gradually mixed 
and incubated at room temperature for 20 minutes. This 
mixture solution was added dropwlse to the dish and 

10 cultured in a CO2 Incubator at 37®C for 48 hours. 

With 10 ml of the resulting culture supernatant, NaNa 
(0.05%), NaCl (150 ml), CaCla (2 ml), anti-FLAG Ml affinity 
gel (manufactured by Sigma) (100 were mixed and stirred 

overnight at 4*'C. The next day, the mixture was centrifuged 

15 (3000 rpm, 5 min, 4*^0 to collect a pellet to which 900 fxl 
of 2 ml CaCl2/TBS was in turn added. The mixture was 
centrifuged again (2000 rpm, 5 min, 4^C), and the resulting 
pellet was floated in 200 \il of 1 ml CaClz/TBS and used as 
a sample (G9 enzyme solution) for activity measurement. A 

20 portion of this was subjected to electrophoresis by SDS- 

PAGE and subsequently to western blotting with anti FLAG-M2 
peroxidase (manufactured by Sigma) , to confirm the 
expression of the G9 protein of interest. As a result, a 
band was detected at the position of approximately 45 kDa, 

25 and the expression was therefore confirmed. 



Expression of G9 recombinant protein in an Insect cell line 



For obtaining a G9 recombinant protein, the G9 was 
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expressed In an Insect cell. Only the expression of an 
active region from the 105th amino acid to the C terminus 
in SEQ ID NO: 2 which relatively retains homology to at 

least pi,3GlcNAc transferase and pi,3Gal transferase may be 
5 adequate for Investigating a function. However, here, it 
has been decided to express a predicted active region from 
the 36th amino acid to the C terminus in the G9 . 

Using, as a template, the plasmid DNA where the G9 
was incorporated into the pBluescript (R) II SK(-), PGR 

10 reaction was carried out with a 5' primer (SEQ ID NO: 11) 

and a 3' primer (SEQ ID NO: 12) to obtain a DNA fragment of 
interest. The PGR method comprised 25 cycles, each cycle 
having 94®C for 30 seconds, 65°C for 1 minute, and 72°C for 
1 minute. Then, a PGR product was subjected to agarose gel 

15 electrophoresis. The gel was cut out by a gel excision 
method to isolate the PGR product by a standard method. 
The PGR product thus isolated was incorporated into pDONR™ 
201 (manufactured by Invitrogen) by BP clohase reaction to 
create an "entry clone" . 

20 Reaction was carried out by Incubating 2 \il of the 

PGR product, 1 \il (150 ng) of the pDONR 201, 2 (xl of a BP 
reaction buffer, 3 (Jil of a Tris-EDTA buffer (pH 8.0; 
hereinafter, also abbreviated to "TE"), and 2 p.1 of BP 
clonase mix at 25**G for 1 hour. The mixture was then 

25 supplemented with 1 (il of a proteinase K (manufactured by 
Kaken Pharmaceutical) and incubated at 37*^0 for 10 minutes 
to terminate the reaction. The reaction mixture solution 

(11 (il) was mixed with 100 (xl of a competent cell (E, coll 
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DH5a) , then transformed by a heat shock method « and seeded 
to a LB plate containing kanamycln. The next day^ a colony 
was collected to confirm the Introduction of the DNA of 
interest and its nucleotide sequence by PGR. The vector in 
5 which the DNA was Inserted (pDONR-G9) was extracted and 
purified according to a standard method. The nucleotide 
sequence of the DNA inserted in this vector was confirmed 
to contain the nucleotide sequence described in SEQ ID 
NO: 1. 

10 

Preparation of expression clone 

The above -described entry clone has attL sites at 
both ends of the inserted site that are recombination sites 
when a X phage is excised from E. coll. By mixing the 

15 entry clone with a LR clonase (mixture of X phage 

recombinases Int, IHF, and Xis) and a destination vector 
(which has attR) , the inserted site was transferred to the 
destination vector to generate an expression clone. 

With 1 \il of the entry clone (pDONR-G9), 0.5 \il 

20 (75 ng) of the destination vector (pFBIF), 2 (xl of a LR 

reaction buffer, 4.5 [il of TE, and 2 \il of LR clonase mix 
(mixture solution of X phage recombinases Int, IHF, and 
Xis) were incubated at 25**C for 1 hour. The mixture was 
then supplemented with 1 (xl of a proteinase K (manufactured 

25 by Kaken Pharmaceutical) and incubated at 37^C for 10 
minutes to terminate the reaction (this recombination 
reaction yields pFBIF-G9). The pFBIF was obtained by 
inserting an IgK signal sequence and a FLAG peptide for 
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purification Into pFastBacl (manufactured by Invltrogen) 
according to a standard method. For further Inserting the 
Gateway sequence (attR) Into the pFBIF, Gateway Vector 
Conversion System (manufactured by Invltrogen) was used to 
5 Insert a conversion cassette. This conversion cassette Is 
a cassette for altering an expression vector to a 
destination vector and has an attR recombination site, a 
chloramphenlcol-reslstant gene, and a ccdB gene encoding a 
protein that Inhibits an E. coll DNA gyrase. The IgK 

10 signal sequence was Inserted for rendering an expressed 
protein secretory, while the FLAG tag was Inserted for 
facilitating purification. 

The reaction mixture solution (11 |xl) containing the 
pFBIF-G9 and 100 \il of a competent cell E. coll DH5a were 

15 mixed and transformed by a heat shock method, and the 
resulting recombinant DH5a was seeded to a LB medium 
containing ampiclllln and then cultured. After 24 -hour 
culture, a colony was collected, and the plasmld (pFBIF-G9) 
was extracted and purified by QIAprep Spin Miniprep Kit 

20 (manufactured by Qiagen) . A PGR method was used to confirm 
the Insertion of the DNA of Interest. 

Preparation of bacmld by Bac-to-Bac system (manufactured by 
Invltrogen) 

25 Subsequently, using a Bac-to-Bac system (manufactured 

by Invltrogen), recombination was conducted between the 
PFBIF-G9 and a bacmld to insert the G9 sequence into the 
bacmld capable of proliferation in an Insect cell. This 



system Is a system that utilizes the recombination site of 
Tn7 and allows the Incorporation of the gene of Interest 
{G9) Into the bacmld through a recombination protein 
produced by a helper plasmld only by Introducing, Into 
5 bacmld- containing E. coll (E. coll DHlOBac™) , pFastBac In 
which the gene of Interest Is Inserted (I.e., pFBIF-G9). 
Moreover, the bacmld contains a LacZ gene and Is selectable 
by classical colony colors (blue (without Insertion) to 
white (with Insertion)). 

10 That Is, 50 \il of the above -described purified vector 

(pFBIF-G9) and 50 \il of a competent cell (E. coll DHlOBac) 
were mixed, then transformed by a heat shock method, and 
seeded to a LB medium containing kanamycln, gentamlcln, 
tetracycline, 5-bromolndolyl-p-D-galactopyranoslde (Bluo- 

15 gal), and isopropyl-p-D- thlogalactopyranoslde (IPTG). 

After 24 hours, an Independent white colony where the DNA 
of Interest Is Inserted Into the bacmld was collected and 
further cultured to collect the bacmld according to a 
standard method. 

20 

Introduction of bacmld Into Insect cell 

The Insertion of the DNA of Interest Into the 
collected bacmld was confirmed according to a standard 
method, and the bacmld was transfected Into an Insect cell 
25 (Sf21; manufactured by Invltrogen) . That Is, a Sf900SFM 
medium (manufactured by Invltrogen) containing an 
antibiotic was added to the Sf21 cells at 9x10^ cells/2 ml 
which was then allowed to adhere to a 35 -mm Petri dish at 
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27^C for 1 hour. After the cells were confirmed to adhere 
to the Petri dish, the culture solution was aspirated. The 
cells were supplemented and Incubated at 27** for 5 hours 
with a culture solution where 800 |xl of Sf900II 
(manufactured by Invltrogen) was added to a solution of 
llpld-DNA complexes (solution obtained by gently mixing and 
Incubating, at room temperature for 30 minutes, A Solution 
(mixture of 5 (il of the above -described bacmld added to 
100 \il of Sf-900SFM) and B Solution (mixture of 6 ^1 of 
Cellfectln Reagent (manufactured by Invltrogen) added to 
100 \il of Sf-900SFM)). Then, the medium was removed, and 
the cells were supplemented with 2 ml of a SF900SFM medium 
containing an antibiotic and incubated 27**C for 72 hours. 
After culture, the cells were liberated by pipetting. The 
cells and the culture solution were collected and 
centrifuged at lOOOxg for 10 minutes to collect a 
supernatant (this supernatant was used as a "primary virus 
solution" ) . 

Further, 1x10^ Sf21 cells/20 ml of Sf-900SFM 
(containing an antibiotic) were added to a T75 culture 
flask and incubated at 27°C for 1 hour. After the adhesion 
of the cells to the flask, 800 \il of the primary virus 
solution was added and cultured at 27''C for 48 hours. After 
culture, the cells were liberated by pipetting. The cells 
and the culture solution were collected and centrifuged at 
lOOOxg for 10 minutes to collect a supernatant (this 
supernatant was used as a "secondary virus solution"). 

Further, Ixio'' Sf21 cells/20 ml of Sf-900SFM 



(containing an antibiotic) were added to a T75 culture 
flask and Incubated at Zl^'C for 1 hour. After the adhesion 
of the cells to the flask, 1000 fxl of the secondary virus 
solution was added and cultured at 27^C for 84 hours. After 
culture, the cells were liberated by pipetting. The cells 
and the culture solution were collected and centrlfuged at 
lOOOxg for 10 minutes to collect a supernatant (this 
supernatant was used as a "tertiary virus solution"). 

In addition, 100 \il of Sf-900SFM (containing an 
antibiotic) containing Sf21 cells at a concentration of 
6x10^ cells/ml and subsequently 1 ml of the tertiary virus 
solution were added to a 100-ml spinner flask and cultured 
at 27**C for 96 hours. After culture, the cells and the 
culture solution were collected and centrlfuged at lOOOxg 
for 10 minutes to collect a supernatant (this supernatant 
was used as a "quaternary virus solution"). 

NaNa, NaCl, and CaCl2 were added to 10 ml of the 
quaternary virus solution. The final concentration Is set 
to 0.05% for NaN3, to 150 mM for NaCl, and to 2 mM for CaCl2. 
To the mixture solution, 50 jjil of ant 1 -FLAG Ml antibody 
affinity gel (manufactured by Sigma) was added and gently 
mixed by Inversion at 4*^0 for 16 hours. Following 
centrlfugation (lOOOxg, 3 mln. 4°C) to remove the resulting 
supernatant, the affinity gel was washed twice with TBS 
(Trls-buffered saline, pH 7.4) containing 1 mM CaCl2. After 
washing, the affinity gel was suspended In 200 \il of TBS 
(pH 7.4) containing 1 mM CaCla, and this suspension was 
used as a G9 enzyme solution for activity measurement. 
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Example 4 Construction ofG9 enzyme protein with meunmal cell 
expression system 

(1) Construction of secretory G9 polypeptide recombinant 
As shown in the examples above, the G9 polypeptide 
5 constructed by deleting a region on the N- terminal side of 
the polypeptide was confirmed to be capable of being 
expressed as a protein in an insect cell and so on. A 
protein constructed as a FLAG peptide -fused G9 polypeptide 
by deleting a region on the N- terminal side in a similar 

10 way can also be Isolated and purified as an enzyme protein 
having an activity by a mammalian cell expression system. 

The part likely to be the catalytic region of an 
enzyme protein in the ORF of the G9 nucleic acid could be 
obtained by using, as a template, the plasmld DNA where the 

15 G9 is incorporated into the pBluescript (R) II SK(-) to 

carry out PCR reaction with a 5 ' primer having a nucleic 
acid sequence of either SEQ ID NO: 13 or 14 and a 3' primer 
having a nucleic acid sequence of SEQ ID NO: 15. The PCR 
method employed a Pfx Taq DNA polymerase (manufactured by 

20 Invltrogen) and comprised 25 cycles, each cycle having 94*'C 
for 15 seconds, 60®C for 30 seconds, and 68**C for 1 minute 
in the presence of 5 ng of the template in a total of 50 jxl 
of a reaction solution. Then, a PCR product was subjected 
to agarose gel electrophoresis. The gel was cut out by a 

25 gel excision method to Isolate the PCR product by a 
standard method, 

A recombinant DNA fragment having a nucleic acid 
sequence from nucleotide Nos. 76 to 1194 in SEQ ID NO: 1 
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was obtained by PGR reaction using the 5' primer of SEQ ID 
NO: 13 and the 3' primer of SEQ ID NO: 15. This fragment 
corresponds to that encoding an amino acid sequence from 
amino acid Nos . 26 to 397 In SEQ ID NO: 2, that Is, an 
5 amino acid sequence of SEQ ID NO: 16. 

Alternatively, a recombinant DNA fragment having a 
sequence shorter than that of the above -described DNA 
fragment was obtained by PGR reaction using the 5 ' primer 
of SEQ ID NO: 14 and the 3' primer of SEQ ID NO: 15. This 

10 fragment Is composed of a nucleotide sequence from 

nucleotide Nos . 97 to 1194 In SEQ ID NO: 1 and corresponds 
to that encoding an amino acid sequence from amino acid No. 
33 to 397 In SEQ ID NO: 2, that Is, an amino acid sequence 
of SEQ ID NO: 17. Although all of the polypeptides 

15 expressed from these recombinant DNA fragments were 
confirmed to have an enzyme activity, the longer DNA 
fragment obtained by the combination of the primers of SEQ 
ID NO: 13 and SEQ ID NO: 15 was used In the experiments 
below. 

20 By adding the signal sequence of preprotrypsln and a 

FLAG peptide ( Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys ) to the 
region of the G9 polypeptide as described above In which 
Initiation methionine has been removed, the secretion and 
expression of a FLAG peptlde-f used G9 polypeptide can be 

25 undertaken. That Is, after the DNA fragment amplified with 
the primers was treated with restriction enzymes Hindi I I 
and EcoRI, the DNA was purified and Inserted between 
Hlndlll and EcoRI In the cloning site of a pFLAG-CMV3 
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(manuf actiired by Invltrogen, Carlsbad, CA) vector, followed 
by ligation reaction to thereby create pFLAG-CMV3-G9. The 
reaction solution was purified by an ethanol precipitation 
method and then mixed with a competent cell (E, coli DH5a) . 
The mixture was subjected to a heat shock method (42**C, 45 
sec . ) and seeded to a LB agar medium containing ampicillin . 
The next day, the insertion of the DNA of interest in the 
resulting colony was directly confirmed by PCR. After the 
plasmid DNA sequence was confirmed by sequencing for 
additional confirmation, the vector (pFLAG-CMV3-G9 ) was 
extracted and purified. As described below, the obtained 
recombinant DNA can be expressed as an enzyme protein and 
purified, and its activity can be confirmed. Here, 
computer prediction and preliminary experiments, and so on 
indicated that catalytic regions in both transmembrane and 
secretory forms probably had the same exon (sequence). In 
addition, the N- terminal side is composed of a region not 
required for the temporary confirmation of an activity such 
as a transmembrane region, so that the sequence portion as 
described above was used. 

The PFLAG-CMV3-G9 plasmid created as above was 
transfected into a HEK293T cell, and the FLAG peptide-f used 
G9 polypeptide was expressed and secreted into a culture 
supernatant. The transfection method was carried out using 
Lipofect amine 2000 (manufactured by Invitrogen) according 
to the attached instruction. Specifically, human kidney 
cell-derived cell line 293T cells were suspended at 2x10® 
cells in 10 ml of a DMEM medium containing 10% fetal bovine 
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serum but no antibiotic, then seeded to a 10 -cm dish, and 
cultured In a CO2 Incubator at 37**C for 16 hours. The 
pFLAG-CMV3-G9 (20 ng) and 30^1 of Llpof ectamlne 2000 
(manufactured by Invltrogen) were mixed with 1.5 ml of 
5 OPTI-MEM (manufactured by Invltrogen), respectively, and 
Incubated at room temperature for 5 minutes. Further, 
these two solutions were gradually mixed and Incubated at 
room temperature for 20 minutes. This mixture solution was 
added dropwlse to the dish and cultured In a CO2 Incubator 

10 at 37°C for 48 hours. 

It is noted that the portion of 372 amino acid 
residues described in SEQ ID NO: 16 is the portion free 
from the FLAG and so on in the amino acid sequence of the 
G9 polypeptide portion obtained as described above. On the 

15 other hand, a polypeptide portion of SEQ ID NO: 17 is 

obtained in a similar way, but having a slightly shorter 
length Is contained In the sequence of SEQ ID NO: 16 and is 
a region consisting of 282 amino acid residues containing a 
region that is considered as an active domain from the 

20 viewpoint of homology, to a pi , 3-N-acetyl-D- 

glucosamlnyltransf erase protein known In the art. Those 
containing at least this region probably show the enzyme 
activity of interest. 

(2) Purification of G9 polypeptide secreted into culture 
25 solution 

The G9 polypeptide recombinant described above is 
secreted and expressed as a fusion protein with the FLAG 
peptide and as such, can easily be purified using antl-FLAG 
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Ml antibody affinity gel (manufactured by Sigma) • The 
purified enzyme protein may be liberated from the ant 1 -FLAG 
Ml affinity gel or otherwise may be used by being adsorbed 
in the gel. In the present excunple, the enzyme was used by 
5 being absorbed in the gel, to carry out experiments. 

NciNa, NaCl, and CaCla were added to 15 ml of the 
culture supernatant acquired as above to be brought to 0,1%, 
150 mmol/1, and 2 mmol/1, respectively, at the final 
concentration. The mixture was then supplemented with 

10 100 jil of Anti-FLAG Ml Affinity Gel (manufactured by COSMO 
Bio) and gradually stirred overnight at 4*^C. 

The next day, the mixture was centrlfuged (3000 rpm, 
5 mln, 4''C) to collect a pellet to which 900 (xl of 2 mM 
CaCl2/TBS was in turn added. The mixture was centrlfuged 

15 again (2000 rpm, 5 mln, 4''C), and the resulting pellet was 
floated in 200 \il of 1 ml CaCl2/TBS and used as a sample 
(G9 enzyme solution) for activity measurement. A portion 
of this was subjected to electrophoresis by SDS-PAGE and 
subsequently to western blotting with anti-FLAG M2 

20 peroxidase (manufactured by Sigma) to confirm the 

expression of the G9 protein of Interest. As a result, a 
band was detected at the position of approximately 45 kDa, 
and the expression was therefore confirmed. 

After washing, the gel was supplemented with 30 iil of 

25 a buffer containing 50 mmol/1 Trls-HCl (pH 7.4), 150 mmol/1 
NaCl, and 2 mmol/1 EDTA and treated at 4''C for 30 minutes 
to thereby elute the protein adsorbed in the gel. Then, 
the 10 -minute centrif ugation of the gel at 160xg yielded a 
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supernatant. After the gel was supplemented again with 30 
|il of a buffer containing 50 mmol/l Trls-HCl (pH 7.4), 150 
mmol/1 NaCl, and 2 mmol/l EDTA and treated at 4*'C for 10 
minutes, the gel was centrlfuged at 160xg for 10 minutes to 
5 thereby acquire a supernatant. Then, the above procedures 
were carried out again, and a total of 3 elutlon procedures 
were performed. The obtained eluate was supplemented with 
1 mol/1 CaCl2 to be brought to 4 mmol/l at the final 
concentration. This eluate was used as an enzyme source. 

10 

Example 5 Analysis of G9 enzyme protein for enzyme 
activity (activity measurement using glycollpld and 
synthetic monosaccharides, and so on, as substrates) 
The result of comparison with other 
15 glycosyltransf erases known In the art on the basis of the 

amino acid sequence described In SEQ ID NO: 2 has suggested 
that the G9 Is classified Into transferases In light of, 
for example, the conservation of a sequence In the 
C- terminal region considered as an active site. Therefore, 
20 for example, UDP-GlcNAc can be used as a GlcNAc donor 
substrate to confirm the enzyme activity of the enzyme 
solution containing the G9 polypeptide, which has been 
obtained in Example 4 above. 

In the present example, the activity of the G9 enzyme 
25 was measured according to almost the same method as methods 
described In the following references [1] to [3]: 

[1] Shlraishi N, Natsume A, Togayachi A, Endo T, 
Akashlma T, Yamada Y, Imal N, Nakagawa S, Koizumi S, Sekine 
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Narlmatsu H, Sasaki K. , Identification and 
characterization of three novel beta 1,3-N- 
acetylglucosamlnyltransf erases structurally related to the 
beta 1,3-galactosyltransferase family,, J Biol Chem* , 2001 
Feb 2; 276(5): 3498-507.; 

[2] Togayachl A, Akashlma T, Ookubo R, Kudo 
Nlshlhara S, Iwasakl H, Natsume A, Mio H, Inokuchl J, 
Irlmura T, Sasaki K, Narlmatsu H. , Molecular cloning and 
characterization of UDP-GlcNAc : lactosylceramlde beta 1,3-N- 
acetylglucosamlnyltransferase (beta 3Gn-T5) , an essential 
enzyme for the expression of HNK-1 and Lewis X epitopes on 
glycollplds. , J Biol chem. , 2001 Jun 22; 276(25): 22032- 
40 . ; and 

[3] Iwal Inaba N, Naundorf A, Zhang Y, Gotoh 
Iwasakl H, Kudo T, Togayachl A, Ishlzuka Y, Nakanlshi H, 
Narlmatsu H. , Molecular cloning and characterization of a 
novel UDP-GlcNAc:GalNAc- pep tide P 1,3-N 
acetylglucosamlnyltransf erase (P3Gn-T6), an enzyme 
synthesizing the core 3 structure of O-glycans., J Biol 
Chem., 2001 Jun 22; 276(25): 22032-40. 
(1) Investigation of substrate specificity 

The FLAG peptide -fused G9 polypeptide obtained In the 
example above was measured for a pi,3-N- 
acetylglucosamlnyltransf erase activity with 
monosaccharaide/ollgosaccharlde/glycollpld/glycoproteln as 
substrates according to known methods (e.g., the above- 
described references and FEBS, 462, 289 (1999), J. Biol. 
Chem. 269, 14730-14737 (1994), J. Biol. Chem., 267, 23507 
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(1992), and J. Biol. Chem. , 267, 2994 (1992)). 

pNp-a-Glc, pNp-p-Glc, pNp-a-GlcNAc, pNp-P-GlcNAc, 
pNp-a-Gal, oNp-p-Gal, pNp-a-GalNAc , pNp-a-Xyl, oNp-p-Xyl, 
pNp-a-Fuc, Bz-a-Man, Bz-a-ManNAc, corel-a-pNp, core3-a-pNp, 
5 Bz-p-lactoslde, GlcNAcpi-4GlcNAcp-Bz , and Galpl-4GlcNAca- 
pNp were used as acceptor substrates (see Table 5). These 
substrates can be purchased from, for example, Sigma Corp. 
or Toronto Research Chemicals Inc. 

10 Table 5 

Substrate specificity of G9 

Acceptor substrate Relative activity 







% 


1 


Gala-pNP* 


ND 


2 


Gaip-oNP* 


ND 


3 


GlcNAca-Bz*' 


ND 


4 


GlcNAcP-Bz" 


ND 


5 


GalNAca-pNP" 


ND 


6 


GalNAcP-pNP*> 


ND 


7 


Glca-pNP* 


ND 


8 


Glcp-pNP" 


ND 


9 


Fuca-pNP* 


ND 


10 


Xyla-pNP" 


ND 


11 


Xylp-pNP** 


ND 


12 


Mana-Bz" 


ND 


13 


Lactoslde p-Bz** 


27 


14 


Galpl-3GalNAca-pNP (core 1)° 


ND 


15 


GlcNAcpl-3GalNAca-pNP{core 2)° 


ND 


16 


Galpl-3GlcNAcP-pNP* 


ND 


17 


GlcNAcP 1 - 4GlcNAcP-Bz*' 


ND 


18 


Galpl - 4GlcNAca-pNP*' 


100 
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(In Table 5^ ND represents "not detected"; and the acceptor 
substrates marked by the superscript "a" were purchased 
from Calblochem, the acceptor substrates marked by "b" were 
purchased from Sigma- Aldrlch Corp, and the acceptor 
5 substrates marked by "c" were purchased from Toronto 
Research Chemicals Inc.) 

A basic reaction solution In the case of using a 
radiolabeled substrate as the donor substrate comprises 
20 \il In total of 14 mM HEPES buffer (pH 7.4), 10 mM MnCl2, 

10 0.15% Triton CF-54, 0.75 mM ATP, 50 \M UDP-GlcNAc 

(manufactured by Sigma), 4.5 \aM (50 nCl) [^^C]UDP-GlcNAc 
(manufactured by Amersham Biosciences), 10 |jiM substrate 
(each of the above -de scribed acceptor substrates), and an 
appropriate cimount (5 to 10 jil) of the purified enzyme 

15 protein (enzyme source obtained in Example 4). Enzyme 

reaction was carried out at 37°C for several hours up to 
16 hours (usually for 16 hours). 

After the completion of reaction, the resulting 
solution was supplemented with 200 \il of O.IM KCl and 

20 lightly centrlfuged, and then a supernatant was obtained. 
After being washed once with 10 ml of methanol, the 
supernatant was loaded onto Sep-Pak C18 Cartridge (Waters) 
equilibrated by two-time washing with 10 ml of 0.1 M KCl, 
and the substrate and a product in the supernatant were 

25 adsorbed in the cartridge. After the cartridge was washed 
twice with 1 ml of pure water for HPLC, the adsorbed 
substrate and product were eluted with 1 ml of methanol. 
Following the mixing with a liquid scintillator (ACSII; 



manufactured by Amersham Biosciences ) , the amount of 
radiation from the product was measured with a 
scintillation counter. 

As a result, the G9 polypeptide exhibited the highest 
5 transferring activity for Galpl-4GlcNAca-pNp of the 

acceptor substrates tested and also exhibited a certain 
amount of transferring activity for Bz-p-lactoslde (see 
Table 5 ) . 

(2) Investigation of enzyme reaction conditions 

10 Buffer conditions were Investigated using the same 

reaction solution and reaction time as above but using 
HEPES (pH 6.75 to 7,4) and sodliim cacodylate (pH 6,4 to 
7.2) Instead of 14 mM HEPES buffer (pH7.4) (see Figure 2A) . 
As shown In Figure 2A, the G9 polypeptide exhibited 

15 an activity in the case of using any of sodium cacodylate 
and HEPES. Generally, its activity was high at or around 
neutral. The G9 polypeptide had an activity that rises 
with an Increase In pH within the neutral region from pH 
6.4 to 7.2 in the sodium cacodylate and showed the maximum 

20 activity around pH 7.0 in the HEPES. 

Since some glycosyltransf erases require a metal ion , 
the metal ion requirement of the G9 enzyme was investigated. 
The same reaction solution and reaction time as above were 
used, and MnCl2, C0CI2, MgCl2, ZnCl2, NICI2, and CdCl2 were 

25 used Instead of 10 mM MnCl2. The concentrations of each of 
the metal ions were set to 2.5, 10, and 40 mM, and the 
respective enzyme reactions were conducted (see Figure 23). 
As shown in Figure 2B, the activity of the G9 
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polypeptide is significantly enhanced In the presence of at 
least the Mn Ion or Co Ion of the divalent metal Ions and 
particularly remarkably enhanced in the presence of the Mn 
ion. These enhanced activities rapidly rise in the low 
5 concentration region of the ion and subsequently gradually 
decrease. Although the Cd and Ni ions show slight 
enhancing effect in the low concentration region, no 
enhancing effect is substantially observed in the Mg and Zn 
ions • 

10 The above results have demonstrated that the activity 

is shown most highly under the basic reaction conditions 
used in (1) above. The same conditions were used to carry 
out reaction in experiments below. 
(3) Activity measurement in the case of using 

15 oligosaccharides pyridylaminated with 2 - aminopyr idine 
(N-glycans) as acceptor substrates 

Commercially -available PA- oligosaccharides were used 
as substrates. An oligosaccharide can be pyridylaminated 
(PA) with 2 -aminopyr idine according to a standard method 

20 (Hase, S., Ibuki, T., and Ikenaka, T., J. Biochem. 95, 

197-203 (1984)), and the PA- oligosaccharide was purchased 
from TAKARA SHUZO Co., Ltd. or SEIKAGAKU Corp. The 
specific test method is performed as follows; 

After 16 -hour reaction at 37''C in a total of 20 \il of 

25 reaction solution containing 14 mM HEPES buffer (pH 7.4), 
50 mM UDP-GlcNAc, 10 mM MnCl2, 0.15% Triton CF-54, 40 pmol 
acceptor substrate ( PA- oligosaccharide ) , and an appropriate 
amount (200 ng) of a purified enzyme protein, a product was 
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detected by high performance liquid chromatography (HPLC; 
described below In detail). The enzyme purified with the 
antl-FLAG Ml antibody affinity gel in Example 4 was used as 
the above- described purified enzyme protein. 
5 After the assay solution where the reaction was 

completed was treated at 100**C for 5 minutes, the solution 
was supplemented with 80 \il of pure water for HPLC and 
centrlfuged at 10,000xg for 5 minutes to obtain a 
supernatant. Subsequently, the supernatant was loaded onto 
10 an Ultrafree-MC column (manufactured by Mllllpore), and a 
portion (40 \xl) thereof was subjected to HPLC. The 
Ultrafree-MC column was used according to the attached 
Instruction. 

HPLC was carried out using a PALPAK Type R column 
15 (TAKARA) as a column and Eluant A: 100 mM acetic 

acid/triethylamine (pH 4.0) and Eluant B: 100 mM acetic 
acid/ trie thylamine (pH 4.0)/0.5% 1-butanol as eluants under 
such conditions that: Eluant B gradient was 5 to 55% (0 to 
60 minutes); a column temperature was 40*'C; and a flow rate 
20 was 1 ml/mln. A product was detected using a fluorescence 
spectrum photometer RF-IOAXL (manufactured by Shlmadzu) 
(excitation wavelength at 320 nm and radiation wavelength 
at 400 nm) « 

As a result, the G9 polypeptide exhibited a 
25 significant activity for tetraantennary N-glycans (PA-004 

and PA-011: Takara PA-substrate number) (see Figure 3). By 
contrast, a pi , 3-N-acetylglucosaminyltransf erase 2 (p3GnT2) 
known in the art exhibits an activity for all types (PA- 001 
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to -0011) of N-glycans, regardless of the number of an 
oligosaccharide. This also suggests that the G9 
polypeptide in the present example has a selective activity, 
that Is, It exhibits a significant activity for a 
5 tetraantennary N-glycan but exhibits no significant 

activity for monoantennary to trlantennary N-glycans. In 
the result shown In Figure 3 , the activity of the G9 
polypeptide appears relatively weak as compared to that of 
the pi , 3-N-acetylglucosamlnyltransf erase 2. However, this 

10 Is because the pi , 3-N-acetylglucosamlnyltransf erase 2 Is a 
homolog enzyme having a greatly strong enzyme activity. 
(4) Activity measurement In the case of using glycoproteins 
as acceptor substrates 

Enzyme reaction was carried out with glycoproteins as 

15 substrates. A reaction solution comprised 20 jil in total 
of 14 mM HEPES buffer (pH 7.4), 10 mM MnCla, 0.15% Triton 
CF-54, 0-75 mM ATP, 50 [M UDP-GlcNAc (manufactured by 
Sigma), 4.5 \jM (50 nCl) [^^C]UDP-GlcNAc (manufactured by 
Amersham Biosciences), 40 ^g of an acceptor substrate, and 

20 an appropriate amount (5 to 10 |xl) of a purified enzyme 

protein. A al-acld glycoprotein (orosomucoid; manufactured 
by Sigma), ovalbumin (manufactured by Sigma), or ovomucoid 
(manufactured by Sigma) was used as the above described 
acceptor substrate. The enzyme purified with the antl-FLAG 

25 Ml antibody affinity gel in Example 4 was used as the 
above -described purified enzyme protein. 

Reaction was carried out at 37°C for several hours up 
to 16 hours. A portion of a reaction product was subjected 
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to enzyme digestion with glycopeptldase F (GPF; 
manufactured by TAKARA) according to the Instruction. The 
samples both before and after enzyme digestion were 
analyzed by 10% SDS-PAGE (see Figure 4). 

As a result, the G9 polypeptide showed a GlcNAc- 
transferrlng activity for all of the glycoproteins 
(relatively weak activity for the ovalbumin) and the band 
disappeared by GPF digestion. This has demonstrated that 
such transferring reaction takes place against the N-glycan 
oligosaccharide of each glycoprotein. 

Example 6 Construction of G9 knockout mouse 

At least the ORF of an mG9 gene Is likely to have a 
single exon. Created Is a targeting vector (pBSK-mG9- 
15 KOneo) In which a chromosome fragment (approximately 10 kb) 
mainly having an approximately 10 -kb fragment containing a 
region considered as the active domain of the mG9 desired 
to be knocked out (e.g., a nucleotide sequence from 
nucleotide No. 97 to 1194 In SEQ ID NO: 1) was Inserted 
20 Into pBluescrlpt (R) II SK (-) (manufactured by TOYOBO) . 

The pBSK-mG9 -KOneo employs neo (neomycin -resistant gene) as 
a drug-resistant gene and lacks the predicted GlcNAc- 
transf erring activity region of the mG9, and this lacking 
portion Is replaced with the neo. After the pBSK-mG9 -KOneo 
25 thus obtained Is rendered linear with a restriction enzyme 
NotI, Its 80-fxg aliquot Is transfected (e.g. 
electroporated) Into ES cells (derived from an E14/129Sv 
mouse) to pick G418-reslstant colonies. The G418-reslstant 
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5 



10 



colonies are transferred to a 2 4 -well plate and cultured. 
After some of the cells are cryopreserved, DNA Is extracted 
from the remaining ES cells, and approximately 120 colonies 
of clones that have undergone recombination are selected by 
5 PGR, In addition, PGR and southern blotting, and so on, 

are used to ascertain that recombination occurs as planned. 
Ultimately, 10 clones of recombinants are selected. The ES 
cells from two of the selected clones are injected into the 
blastocyst of a G57BL/6 mouse. The murine embryo in which 
10 the ES cell has been injected is transplanted into the 
uterus of a foster mother mouse to generate a chimeric 
mouse. Then, a he tero- knockout mouse can be obtained by 
germ transmission. 
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